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TO MY FRIEND 


WALTER DALLENBACH 




FROM THE AUTHOR’S PREFACE TO 
THE FIRST GERMAN EDITION 


T he importance of the standpoint afforded by the theory 
of groups for the discovery of the general laws of 
quantum theory has of late become more and more 
apparent. Since I have for some years been deeply concerned 
with the theory of the representation of continuous groups, it 
has seemed to me appropriate and important to give an account 
of the knowledge won by mathematicians working in this field 
in a form suitable to the requirements of quantum physics. An 
additional impetus is to be found in the fact that, from the 
purely mathematical standpoint, it is no longer justifiable to 
draw such sharp distinctions between finite and continuous 
groups in discussing the theory of their representations as has 
been done in the existing texts on the subject. My desire to 
show how the concepts arising in the theory of groups find their 
application in physics by discussing certain of the more important 
examples has necessitated the inclusion of a short account of the 
foundations of quantum physics, for at the time the manuscript 
was written there existed no treatment of the subject to which 
I could refer the reader. In brief this book, if it fulfills its 
purpose, should enable the reader to learn the essentials of the 
theory of groups and of quantum mechanics as well as the rela- 
tionships existing between these two subjects ; the mathematical 
portions have been written with the physicist in mind, and vice 
versa. I have particularly emphasized the “ reciprocity ” be- 
tween the representations of the symmetric permutation group 
and those of the complete linear group ; this reciprocity has as 
yet been unduly neglected in the physical literature, in spite of 
the fact that it follows most naturally from the conceptual 
structure of quantum mechanics. 
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There exists, in my opinion, a plainly discernible parallelism 
between the more recent developments of mathematics and 
physics. Occidental mathematics has in past centuries broken 
away from the Greek view and followed a course which seems 
to have originated in India and which has been transmitted, 
with additions, to us by the Arabs ; in it the concept of number 
appears as logically prior to the concepts of geometry. The 
result of this has been that we have applied this systematically 
developed number concept to all branches, irrespective of whether 
it is most appropriate for these particular applications. But 
the present trend in mathematics is clearly in the direction of a 
return to the Greek standpoint ; we now look upon each branch 
of mathematics as determining its own characteristic domain 
of quantities. The algebraist of the present day considers the 
continuum of real or complex numbers as merely one “ field ” 
among many ; the recent axiomatic foundation of projective 
geometry may be considered as the geometric counterpart of 
this view. This newer mathematics, including the modern 
theory of groups and “ abstract algebra,” is clearly motivated 
by a spirit different from that of “ classical mathematics,” which 
found its highest expression in the theory of functions of a 
complex variable. The continuum of real numbers has retained 
its ancient prerogative in physics for the expression of physical 
measurements, but it can justly be maintained that the essence 
of the new Heisenberg-Schrodinger-Dirac quantum mechanics is 
to be found in the fact that there is associated with each physical 
system a set of quantities, constituting a non-commutative 
algebra in the technical mathematical sense, the elements of 
which are the physical quantities themselves. 


Zurich, August, igzS 



AUTHOR’S PREFACE TO 
THE SECOND GERMAN EDITION 

D uring the academic year 1928-29 I held a professorship 
in mathematical physics in Princeton University. The 
lectures which I gave there and in other American insti- 
tutions afforded me a much desired opportunity to present anew, 
and from an improved pedagogical standpoint, the connection 
between groups and quanta. The experience thus obtained has 
found its expression in this new edition, in which the subject 
has been treated from a more thoroughly elementary standpoint. 
Transcendental methods, which are in group theory based on 
the calculus of group characteristics, have the advantage of 
offering a rapid view of the subject as a whole, but true under- 
standing of the relationships is to be obtained only by following 
an explicit elementary development. I may mention in this 
connection the derivation of the Clebsch-Gordan series, which is 
of fundamental importance for the whole of spectroscopy and 
for the applications of quantum theory to chemistry, the section 
on the Jordan-Holder theorem and its analogues, and above all 
the careful investigation of the connection between the algebra 
of symmetric transformations and the symmetric permutation 
group. The reciprocity laws expressing this connection, which 
were proved by transcendental methods in the first edition, as well 
as the group-theoretic problem arising from the existence of spin 
have also been treated from the elementary standpoint. Indeed, 
the whole of Chapter V — which was, in the opinion of many 
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impossible to avoid presenting the principal part of the theory 
of representations twice ; first in Chapter III, where the repre- 
sentations are taken as given and their properties examined, 
and again in Chapter V, where the method of constructing the 
representations of a given group and of deducing their properties 
is developed. But I believe the reader will find this two-fold 
treatment an advantage rather than a hindrance. 

To come to the changes in the more physical portions, in 
Chapter IV the role of the group of virtual rotations of space 
is more clearly presented. But above all several sections have 
been added which deal with the energy-momentum theorem of 
quantum physics and with the quantization of the wave equation 
in accordance with the recent work of Heisenberg and Pauli. 
This extension already leads so far away from the fundamental 
purpose of the book that I felt forced to omit the formulation 
of the quantum laws in accordance with the general theory of 
relativity, as developed by V. Fock and myself, in spite of its 
desirability for the deduction of the energy-momentum tensor. 
The fundamental problem of the proton and the electron has 
been discussed in its relation to the symmetry properties of the 
quantum laws with respect to the interchange of right and left, 
past and future, and positive and negative electricity. At 
present no solution of the problem seems in sight ; I fear that 
the clouds hanging over this part of the subject will roll together 
to form a new crisis in quantum physics. I have intentionally 
presented the more difficult portions of these problems of spin 
and second quantization in considerable detail, as they have 
been for the most part either entirely ignored or but hastily 
indicated in the large number of texts which have now appeared 
on quantum mechanics. 

It has been rumoured that the “ group pest ” is gradually 
being cut out of quantum physics. This is certainly not true 
in so far as the rotation and Lorentz groups are concerned ; 
as for the permutation group, it does indeed seem possible to 
avoid it with the aid of the Pauli exclusion principle. Never- 
theless the theory must retain the representations of the per- 
mutation group as a natural tool in obtaining an understanding 
of the relationships due to the introduction of spin, so long as 
its specific dynamic effect is-neglected. I have here followed the 
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trend of the times, as far as justifiable, in presenting the group- 
theoretic portions in as elementary a form as possible. The 
calculations of perturbation theory are widely separated from 
these general considerations ; I have therefore restricted myself 
to indicating the method of attack without either going into 
details or mentioning the many applications which have been 
based on the ingenious papers of Hartree, Slater, Dirac and 
others. 

The constants c and h, the velocity of light and the quantum 
of action, have caused some trouble. The insight into the 
significance of these constants, obtained by the theory of rela- 
tivity on the one hand and quantum theory on the other, is 
most forcibly expressed by the fact that they do not occur in 
the laws of Nature in a thoroughly systematic development of 
these theories. But physicists prefer to retain the usual e.g.s. 
units — principally because they are of the order of magnitude of 
the physical quantities with which we deal in everyday life. 
Only a wavering compromise is possible between these practical 
considerations and the ideal of the systematic theorist ; I 
initially adopt, with some regret, the current physical usage, 
but in the course of Chapter IV the theorist gains the upper 
hand. 

An attempt has been made to increase the clarity of the 
exposition by numbering the formulae in accordance with the 
sections to which they belong, by emphasizing the more im- 
portant concepts by the use of boldface type on introducing 
them, and by lists of operational symbols and of letters having 
a fixed significance. 

H. WEYL. 


Gottingen, November, igjo 




TRANSLATOR’S PREFACE 


T his translation was first planned, and in part completed, 
during the academic year 1928-29, when the translator 
was acting as assistant to Professor Weyl in Princeton. 
Unforeseen delays prevented the completion of the manuscript 
at that time, and as Professor Weyl decided shortly afterward 
to undertake the revision outlined in the preface above it seemed 
desirable to follow the revised edition. In the preparation of 
this manuscript the German has been followed as closely as 
possible, in the conviction that any alterations would but de- 
tract from the elegant and logical treatment which characterizes 
Professor Weyl’s works. While an attempt has been made 
to follow the more usual English terminology in general, this 
programme is limited by the fact that tlic fusion of branches of 
knowledge which have in the past been so widely separated as 
the theory of groups and quantum theory can be accomplished 
only by adapting the existing terminology of each to that of 
the other ; a minor difficulty of a similar nature is to be found 
in the fact that the development of “ fields ” and “ algebras ” 
in Chapter V is accomplished in a manner which makes it appear 
desirable to deviate from the accepted English terminology. 

It is a pleasure to express my indebtedness to Professor Weyl 
for general encouragement and assistance, to Professor R. E. 
Winger of Union College for the assistance he has rendered in 
correcting proof and in preparing the index, and to the publishers 
for their cooperation in adhering as closely as possible to the 
original typography. 

H. P. ROBERTSON 


Princeton, September, 1931 
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INTRODUCTION 


T he quantum theory of atomic processes was proposed by 
Niels Bohr in the year 1913, and was based on the 
atomic model proposed earlier by Rutherford. The 
deduction of the Balmer series for the line spectrum of hydrogen 
and of the Rydberg numbc" from universal atomic constants 
constituted its first convincing confirmation. This theory gave 
us the key to the understanding of the regularities observed in 
optical and X-ray spectra, and led to a deeper insight into the 
structure of the periodic system of chemical elements. The issue 
of Naturxmssenschaften, dedicated to Bohr and entitled “ Die 
ersten zchn Jahre der Theorie von Niels Bohr fiber den Bau 
dcr Atome ’’ (Vol. 11 , p. 535 (1923)), gives a short account of the 
successes of the theory at its peak. But about this time it began 
to become more and more apparent that the Bohr theory was 
a compromise between the old “classical” physics and a new 
quantum physics which has been in the process of development 
since Planck’s introduction of energy quanta in 1900. Bohr 
described the situation in an address on “ Atomic Theory and 
Mechanics ” (appearing in Nature, 116 , p. 845 (1925)) in the 
words: “From these results it seems to follow that, in the 
general problem of the quantum theory, one is faced not with 
a modification of the mechanical and electrodynamical theories 
describable in terms of the usual physical concepts, but with 
an essential failure of the pictures in space and time on which 
the description of natural phenomena has hitherto been based.’’ 
The rupture which led to a new stage of the theory was made 
by Heisenberg, who replaced Bohr’s negative prophecy by a 
positive guiding principle. 

The foundations of the new quantum physics, or at least 
its more important theoretical a.spects, are to be treated in this 
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book. For supplementary references on the physical side, 
which are urgently required, I name above all the fourth edition 
of Sommerfeld’s well-known “ Atombau und Spektrallinien ” 
(Braunschweig, 1924), or the English translation “ Atomic 
Structure and Spectral Lines” (London, 1923) of the third 
edition, together with the recent (1929) ” Wellenmechanischer 
Erganzungsband ” or its English translation ” Wave Mechanics ” 
(1930). An equivalent original English book is that of Ruark 
AND Urey, ‘‘Atoms, Molecules and Quanta ” (New York, 1930), 
which appears in the ” International Series in Physics,” edited 
by Richtmeyer. I should also recommend Gerlach’s short 
but valuable survey ‘‘ Experimentelle Grundlagen der Quanten- 
theorie ” (Braunschweig, 1921). The spectroscopic data, pre- 
sented in accordance with the new quantum theory, together 
with complete references to the literature, are given in the 
following three volumes of the series ‘‘ Struktur der Materie,” 
edited by Born and Franck ; — 

F. Hund, ” Linienspektren und periodisches System der 
Elemente ” (1927); 

E. Back and A. Lande, “ Zeemaneffekt und Multiplett- 
struktur der Spektrallinien ” (1925) ; 

W. Grotrian, “ Graphische Darstellung der Spektren von 
Atomen und lonen mit ein, zwei und drei Valenzelektronen ” 
(1928). 

The spectroscopic aspects of the subject are also discussed 
in Pauling and Goudsmit’s recent ‘‘The Structure of Line 
Spectra” (1930), which also appears in the ‘‘International 
Series in Physics.” 

The development of quantum theory has only been made 
possible by the enormous refinement of experimental technique, 
which has given us an almost direct insight into atomic 
processes. If in the following little is said concerning the 
experimental facts, it should not be attributed to the mathe- 
matical haughtiness of the author; to report on these things 
lies outside his field. Allow me to express now, once and for 
all, my deep respect for the work of the experimenter and for 
his fight to wring significant facts from an inflexible Nature, 
who says so distinctly ‘‘No” and so indistinctly “Yes” to 
our theories. 
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Our generation is witness to a development of physical 
knowledge such as has not been seen since the days of Kepler, 
Galileo and Newton, and mathematics has scarcely ever 
experienced such a stormy epoch. Mathematical thought 
removes the spirit from its wor’dly haunts to solitude and 
renounces the unveiling of the secrets of Nature. But as 
recompense, mathematics is less bound to the course of worldly 
events than physics. While the quantum theory can be traced 
back only as far as 1900, the origin of the theory of groups 
is lost in a past scarcely accessible to history ; the earliest 
works of art show that the symmetry groups of plane figures 
were even then already known, although the theory of these 
was only given definite form in the latter part of the eighteenth 
and in the nineteenth centuries. F. Klein considered the 
group concept as most characteristic of nineteenth century 
mathematics. Until the present, its most important application 
to natural science lay in the description of the symmetry of 
crystals, but it has recently been recognized that group theory 
is of fundamental importance for quantum physics ; it here 
reveals the essential features which arc not contingent on a 
special form of the dynamical laws nor on special assumptions 
concerning the forces involved. We may well expect that it is 
just this part of quantum physics which is most certain of a 
lasting place. Two groups, the group of rotations in ydimen- 
sional space and the permutation group, play here the principal 
role, for the laws governing the possible electronic configurations 
grouped about the stationary nucleus of an atom or an ion are 
spherically symmetric with respect to the nucleus, and since the 
various electrons of which the atom or ion is composed are 
identical, these possible configurations are invariant under a 
permutation of the individual electrons. The investigation of 
groups first becomes a connected and complete theory in the 
theory of the representation of groups by linear transformations, 
and it is exactly this mathematically most important part 
which is necessary for an adequate description of the quantum 
mechanical relations. All quantum numbers, loith the exception 
of the so-called principal quantum number, are indices character- 
izing representations of groups. 
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This book, which is to set forth the connection between groups 
and quanta, consists of five chapters. The first of these is 
concerned with unitary geometry. It is sonoewhat distressing 
that the theory of linear algebras must again and again be 
developed from the beginning, for the fundamental concepts 
of this branch of mathematics crop up everywhere in mathe- 
matics and physics, and a knowledge of them should be as 
widely disseminated as the elements of differential calculus. 
In this chapter many details will be introduced with an eye 
to future use in the applications ; it is to be hoped that in 
spite of this the simple thread of the argument has remained 
plainly visible. Chapter II is devoted to preparation on the 
physical side ; only that has been given which seemed to me 
indispensable for an understanding of the meaning and methods 
of quantum theory. A multitude of physical phenomena, which 
have already been dealt with by quantum theory, have been 
omitted. Chapter III develops the elementary portions of the 
theory of representations of groups and Chapter IV applies them 
to quantum physics. 'I'hus mathematics and physics alternate 
in the first four chapters, but in Chapter V the two are fused 
together, showing how com[)letely the mathematical theory is 
adapted to the requirements of quantum physics. In this last 
chapter the permutation group and its representations, together 
with the groups of linear transformations in an affine or unitary 
space of an arbitary number of dimensions, will be subjected to 
a thorough going study. 
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CHAPTER I 


UNITARY GEOMETRY 

§1. The n-dimensional Vector Space 

T he mathematical field of operation of quantum mechanics, 
as well as of the theory of the representations of groups, 
is the multi-dimensional affine or unitary space. The 
axiomatic method of developing the geometry of such a space 
is no doubt the most appropriate, but for the sake of clearness 
I shall at first proceed along purely algebraic lines. I begin 
with the explanation that a vector j in the n-dimensional 
linear space = 9ii„ is a set of n ordered numbers [x-^, ‘ \ x ^ ; 

vector analysis is the calculus of such ordered sets. The two 
fundamental operations of the vector calculus are the multiplica- 
tion of a vector l by a number a and the addition of two vectors j 
and On introducing the notation 

J = (^ 1 , • • •, x„), 1) = (y„ yj, • • •, y„) 

these operations are defined by the equations 

a% = {axi, axt, • • •, ax„), S + ^ = (^i -f yi, acj + yj, • • *, 

”t“ yn)- 

The fundamental rules governing these operations of multiplica- 
tion by a number and addition are given in the following table 
of axioms, in which small German letters denote arbitrary 
vectors and small Latin letters arbitrary numbers : 

(a) Addition. 

1. a + = + {commutative law). 

2. (a -f- b) + c = a + (b + c) [associative law). 

3. a and c being any two vectors, there exists one and only one 
vector j for which ci -1- j = c. It is called the difference c — a of 
C and a [possibility of subtraction)!' 
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(jS) Multiplication. 

1. (a + &)5 = (aj) + (^j) {first distributive law). 

2. a{b%) — (ab)]C {associative law). 

3. Ij = i. 

4. a({ + = (aj) + (a^) {second distributive laid) . 

The existence of a vector 0 = (0, 0, • • *, 0) with the property 
J+0=0+j=j 

need not be postulated separately as it follows from the axioms. 

Affine vector geometry concerns itself entirely with concepts 
which are defined in terms of the two fundamental operations 
with which the axioms (a) and (jS) are concerned ; we mention 
a few of the most important. A number of vectors Oj, • • •, a* 
are said to be linearly independent if there exists between them 
no homogeneous linear relation 

Cjfii + ^202 + • • • + c^ait = 0 

except the trivial one with coefficients 

Ci = 0, Ct = 0, • • •, Ca = 0. 

h such vectors are said to span an h-dimensional {linear) sub- 
space 91' consisting of all vectors of the form 

J + * * • + 1.1) 

where the ^’s are arbitrary numbers. It follows from the 
fundamental theorem on homogeneous linear equations that 
there exists a non- trivial homogeneous relation between any 
h I vectors of 91'. The dimensionality h of 91' can therefore 
be characterized independently of the basis ; every h -j- i vectors 
in 9i' are linearly dependent, but there exist in it h linearly 
independent vectors. Any such system of h independent 
vectors Oi, 02 , • • •, 0* in 91' can be used as a co-ordinate system 
or basis in 91' ; the coefficients ^ 2 i ’ ' in the representation 
(1.1) are then said to be the components of j in the co-ordinate 
system (tti, 02 , •••, Oa). 

The entire space 91 is n-dimensional, and the vectors 

Cl — - (1, 0, 0, • •, 0), 

Ca = (0, 1, 0, • • •, 0), 


Cn = (0, 0, 0, • • •, 1) 

define a co-ordinate system in it in which the components of a 
vector 

I = (^ 1 , • • *, ^») 
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agree with the “ absolute components ” jr, ; 

j = + 4^262 + - • • + 4 r„e„. 

From the standpoint of affine geometry, however, the “ absolute 
co-ordinate system ” (1.2) has no'preference over any other which 
consists of n independent vectors of We now add to the 
previous axioms, which did not concern themselves with the 
dimensionality n, the following dimensionality axiom : 

(y) The maximum number of linearly independent vectors in 91 
is n. 

These axioms (a), (jS), and (y) suffice for a complete formula- 
tion of vector calculus, for if Cj, C 2 , * • •, e„ are any n independent 
vectors and % is any other vector there must necessarily exist 
a linear dependence 

“b + ^2^2 "}■*** + 0 

between them. Since not all the coefficients may vanish we 
must in particular have a 4= 0, and consequently any vector j 
can be expressed as a linear combination 

j = + .r2e2 + • • • + ^nC„ (1.3) 

of the “ fundamental vectors ” c^, C 2 , • • •, c„. We specify j by 
the set (xi, x^, • • *, Xn) of components in this co-ordinate system. 
In accordance with axioms (a) and (/3) for addition and multi- 
plication we then have for any two vectors (1.3) and 0 

a%={ax^)ti-{- t • • + (aAr„)e„, J+l)=(Afi+yi)ei4 \- (r„-fy„)c„, 

and we arrive at the definitions from which we started. The 
only — but important — difference between the arithmetic and 
the axiomatic treatment is that in the former the absolute co- 
ordinate system (1.2) is given the preference over any other, 
whereas in the latter treatment no such distinction is made. 

Given any system of vectors, all vectors J which are obtained, 
as (1.1), by linear combinations of a finite number of vectors 
<^ 1 ) ^ 2 . * ‘ *> of the system constitute a (linear) sub-space — the 
sub-space “ spanned ” by the vectors a. 

9f is said to be decomposed or reduced into two linear sub- 
spaces 9f', 91" (91 = 9?' -f 9i") if an arbitrary vector j can be 
expressed uniquely as the sum of a vector j' of 91' and a vector 
j" of 9f". A co-ordinate system in 9f' and a co-ordinate system 
in 91" constitute together a co-ordinate system for the entire 
space 9f ; this co-ordinate system in 91 is “ adapted ” to the 
decomposition 91' + 91". The sum n' + n" of the dimension- 
alities of 91' and 9ft" is equal to n, the dimensionality of 9ft. 
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Conversely, if the sub-spaces 91" have no vector except 0 
in common, and if the sum of their dimensionalities is n, then 
9fl = in' + 9i". 

9?' being an n-dimensional sub-space, two vectors j and ^ are 
said to be congruent modulo 91' : 

jc =t) (mod. 91'), 

if their difference lies in 91'. Congruence satisfies the axioms 
postulated of any relation of equality : every vector is congruent 
to itself ; if j = ^ (mod. 9i') then t) = ^ (mod. 9i') ; if j s ^ 
(mod. fH') and ^ « 5 (mod. 91'), then S = 3 (mod. 91'). It is 
therefore permissible to consider vectors which are congruent 
mod. 91' as differing in no wise from one another ; by this ab- 
straction, which we call projection with respect to 91', the 
n-dimensional space 9? gives rise to an (n — n')-dimensional 
space 91. 91 is also a vector space, for from 

= U = ^2 (mod. 9i') 

follow the relations 

-i- (mod. 91'). 

The operations of multiplication by a number and addition can 
therefore be considered ones which operate directly on the 
vectors J of 91. All vectors j of 91 which are congruent mod. 91' 
give rise to the same vector j of 91. If 91' is one-dimensional 
and is spanned by e the above process is the familiar one of 
parallel projection in the direction of e ; it is not necessary to 
give an (n — i)-dimensional sub-space of 91 on to which the 
projection is made. 

If a is a non-null vector, all vectors j which arise by multi- 
plying a by a number are said to lie on the same ray as o. Two 
non-null vectors determine the same ray when, and only when, 
one is a multiple of the other. In a given co-ordinate system 
the vector o is characterized by its components a^, a 2 , •••,«« 
whereas the ray a is characterized by their ratios • • • : a„] 

these ratios have meaning only when the components of o- do 
not all vanish, i.e. only when a =b 0- 

The transition from one co-ordinate system c< to another e/ is 
accomplished by expressing the new co-ordinate vectors c/ in 
terms of the old : 

— E^ik 
i-1 
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If Xi, Xi' are the components of an arbitrary vector j in the old 
and in the new co-ordinate systems, respectively, then 

I = Zxi e, = 

t h 

from which the law of transformation 

Xi-= SaikXk (1.4) 

J:=.l 

follows. The requirement that the co-ordinate vectors e*' also 
be linearly independent is expressed arithmetically by the non- 
vanishing of the determinant of the coefficients a.jfc. The com- 
ponents of vectors J, ^, • • • in 91 undergo the same transformation 
on transition to the new co-ordinate system e/ and are said to 
transform cogrediently. 

§ 2. Linear Correspondences. Matrix Calculus 

The formula (1.4) can, however, be otherwise interpreted; 
it is the expression of a linear or affine correspondence or 
mapping of the space 91 on itself. But for this purpose it 
will be found more convenient to interchange the roles of the 
accented and the unaccented co-ordinates. On employing a 
definite co-ordinate system e,-, the equation 

Xi = E^ikXk ( 2 . 1 ) 

associates with an arbitrary vector J with components Xt a vector 
j' with components xf This correspondence J ^ j' of 91 on 
itself can be characterized as linear by the two assertions : if 
J, 0 go over into j', Q', then ajr goes over into ag' and g + ^ into 
g' + t)'. Linear correspondences therefore leave all affine rela- 
tions unaltered ; hence their prominence in the theory of affine 
geometry. In order to show that these two conditions fully 
determine the linear correspondence (2.1), consider the following : 
if a correspondence A which satisfies these conditions sends the 
fundamental vector e* over into 

Cfc' = Z'^ik (2.2) 

i 

then, in consequence of the above requirements, 
g = Cl + • • " x„e„ 


goes over into 


j' = Afi Cl' + • • ' + x„ c„'. 
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On substituting (2.2) in this equation we see that the new vector 
has in the co-ordinate system the components obtained 
from the components Xi of j by means of (2.1). It has become 
customary in quantum physics to call the linear correspondences 
of a vector space 91 operators which operate on the arbitrary 
vector j of 91. 

Let A, B be two linear correspondences, the first of which 
sends the arbitrary vector j over into j' = A^, while the second 
sends j' into j" = Rj' = B(A^). The resultant correspondence 
C, which carries g directly into j", is also linear and is denoted 
by (BA) (to be read from right to left !) ; 

(BA)i = B(Ai). 

This “ multiplication ” satisfies laws which are similar to those 
of multiplication of ordinary numbers ; in particular, the as- 
sociative law 

C{BA) = {CB)A 

is here valid, but the commutative law is not — in general 
AB =t= BA. The “ 1 ” in this domain, which we here denote by 
1 , is the identity, i.e. that correspondence which associates every 
vector I with itself : j -> J. Hence 

A1 = 1A = A. 

The correspondence A is then and only then reversible in case 
it is non-degenerate, i.e. if it carries no non-vanishing vector into 
the vector 0, or if distinct vectors are always carried over into 
distinct ones. The algebraic condition for this is the non- 
vanishing of the determinant = det A ; there then exists 
the inverse correspondence A ~^ : 

AA-^ = A-^A =: 1 . 

The multiplication theorem for determinants states that 
det (BA) = det B • det A. 

Not only can we “ multiply ” two correspondences, we can 
also “ add ” them. This concept of addition arises quite natur- 
ally : if the arbitrary vector j is sent over into by A and-into 
by B, then that correspondence which sends j into j,' -t- Tcf is 
also linear and is denoted hy A B 

(A -f = As + Bs. 

We may also introduce multiplication by an arbitrary number 
a : aA is that correspondence which sends j into a{As). Addition 
and multiplication by a number obey the same laws as the 
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analogous operations on vectors. Addition is commutative, 
and has as its inverse subtraction. The role of 0 is played by 
the correspondence 0 which transforms every vector 5 into the 
vector 0. Addition obeys the distributive law with respect to 
multiplication : 

(A + B}C = AC + BC, C(A + B) = CA CB, 

{aA)C = a{AC), C(aA) = a(CA). 

Before proceeding to the arithmetical expression of these 
operations in a given co-ordinate system, we consider another 
natural generalization. We can map an w-dimensional vector 
space 9fl linearly on an n-dimensional space © ; this is accom- 
plished when with each vector j of 9i a vector 1) of © is associated 
in such a way J ^ that from -> Qi, -> ^2 ‘t follows that 

Ji + ?2 + ha- 

Such a correspondence ^ Q is expressed by equations of 
the form 

yt = {k=\,2,' • •, n) (2.3) 

t - 1 

where Xy, ' ’ x„ are the components of J in a given co-ordinate 
system in the space 9i and yj, • • •, y„ have the corresponding 
interpretation in ©. With this correspondence A there is 
associated the matrix 


a^i 

^12 • • 


^21 

^3^22 • • 

• ^2m 

^nl 

(ln2 • • 

• ^nm 


with n rows and m columns, and which we also denote by 
the same letter A. The first index indicates the row and the 
second the column to which aid belongs. We can also add corre- 
spondences of the same space 91 on the same space ©. Addition 
and multiplication by a number is accomplished on matrices by 
subjecting their n • m components to these operations : if 

A = \\ a^i II and 5 = || bid || 

then 

aA = \\a ' aid ||, A + B — \\ aid + ||. 


If we have a third (/)-dimensional) vector space %, the consec- 
utive application of the correspondences .^4 : j ^ of 91 on © and 
jB : ^ j of © on 2 gives rise to the correspondence C = BA : £ 5 

of 91 on $. This composition is expressed in terms of matrix 
components by the law 


n 

== 

t - 1 


//=!, 2, • 
Vt=l, 2, • 



(2.4) 
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B has p rows and n columns and A n rows and m columns ; the 
composition of matrices is possible when the first factor B has 
the same number of columns as the second factor A has rows. 
The component or element Cu, which is found at the intersection 
of the row and the column, is formed in accordance with 
(2.4) from the components in the row of B and the t*** column 
of A. An important special case is that in which % is the same 
space as ; A is then a correspondence of SH on (3, 5 of © on 91. 
Already here concepts of the theory of groups play an important 



role ; on beginning Chapter III, which deals with the theory of 
groups, the reader should return to the matter here discussed 
as an illustration. 

The matrix calculus allows us to express the formulae for 
a linear correspondence, such as (2.3), in an abbreviated form. 
We do this by denoting by x that matrix whose only column 
consists of the vector components Xi, x^, • • •, x^ ; similarly 
for y. In accordance with the rule (2.4) for the composition of 
matrices, equations (2.3) can be written 

y = Ax. 


(2.6) 
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This form is particularly useful in examining the effect on the 
matrix A of a linear correspondence of a space 91 on a space @ 
when the original co-ordinate systems are replaced by new ones. 
If this change of co-ordinates is effected by the transformations 

Xi = Xi or X = Sx' in 91, 
i 

Ehnyh or y = T/ in ©, 

h 

then from (2.5) 

Ty' = ASx' or y' = (T-^AS)x'. 



The same correspondence in the new co-ordinates is therefore 
expressed by the matrix 

A' = T-'AS. (2.6) 

Let us now return to the linear correspondence ^ of a space 
91 on to itself. If 91' is a linear n'-dimensional sub-space of 91* 
we say that A haves 91' invariant if it carries any vector of 91', 
over into a vector of 91'. If the co-ordinate system is so chosen 
that the first n' fundamental vectors lie in 91', the matrix of 
a correspondence which leaves 91' invariant will assume the 
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form given by Fig. l. All elements in the rectangle of n' columns 
and n — n' rows denoted by zeros in Fig. i, vanish. A contains 
a correspondence of 91' on to itself and at the same time a corre- 
spondence of the space 91 , arising by projecting 91 with respect 
to 9f', on to itself. The matrices of these correspondences con- 
sist in the shaded squares. If 91 is decomposed into 9Ii -j- 912 
{til + «2 = «), and if the correspondence A leaves both sub- 
spaces and 912 invariant, then A is completely reduced 
into a correspondence of 9Ii on itself and a correspondence of 
9^2 on to itself. If the co-ordinate system is adapted to the 
decomposition 9ii + 912, the matrix A is completely reduced into 
two square matrices arranged along the principal diagonal as 
in Fig. 2. The unshaded rectangles are empty — the elements 
situated in these portions are all zero. 

Let the n-dimensional linear space 91 be decomposed into 
sub-spaces 9li + 912 f • * •, 9Ia having the dimensionality n# ; n is 
then equal to the sum «i -t- M2 + • • •. Any vector j can then be 
written uniquely as the sum of components + £2 + ‘ • * which 
lie in the sub-spaces 9fi, 9?2, • • •. The association 5 -> Ja is 
a linear correspondence Ea of 9i on to 9Ia. Given a correspond- 
ence 5' of 91 on to itself, we consider that linear corre- 

spondence [A]oi 0 which carries an arbitrary vector j of 9f^ over 
into the component ja' in 9ia of j'. We call [A]ap the portion of 
A in which 9ia intersects 9^3. This terminology arises from the 
matrix representation of A ; on adapting the co-ordinate system 
to the decomposition 9Ii -j- 912 + ' ' ‘ the set of variables Xi, or 
rather their indices i which number the rows and columns of 
the matrix, is broken up into segments of lengths Ma (a — 1, 2, • • •). 
The matrix A is thereby divided into the single rectangles 
[A]oi 0 in which the a*** set of rows intersects the jS“* set of columns, 
and which consist of Ma • np elements. 

If A is the matrix of a correspondence of 91 on to itself in 
a given co-ordinate system, and A' its matrix in a co-ordinate 
system obtained from the first by means of the reversible 
transformation S, then in accordance with (2.6) 

A' = S-^AS. (2.7) 

The search for an invariantive characterization of correspondences 
may be formulated algebraically : to find expressions which 
are so formed from the components of an arbitrary matrix that 
they assume the same value for equivalent matrices, i.e. for 
matrices A, A' between which a relation (2.7) exists. The way 
in which this can be accomplished is indicated by the related 
problem of finding a vector J 4= 0 which is transformed into 
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multiple Aj of itself under the influence of A. The column x 
f the components of j must then satisfy the equation 

^x — Ax, or (A1 — A)x — 0. 

iut n linear homogeneous equations in n unknowns have a 
on-vanishing solution only if their determinant vanishes ; the 
lultiplier A is therefore necessarily a root of the “ characteristic 
>olynomial ” 

/(A) = det (A1 - A) (2.8) 

if A. This polynomial is an invariant in the above sense, for 
rom (2.7) or SA' — AS it follows that 

5(A1 - A') = (A1 - A)S, 

whence by the theorem concerning the multiplication of deter- 
ninants 

det 5 ' det (A1 — A') = det (A1 — A) • det S. 

since the determinant of the reversible transformation 5 cannot 
vanish, we can divide by it and obtain the required identity 

\X1- A'\ = iAl - ^|. 

The characteristic polynomial is of degree n in A : 

/(A) = A" — ’ ±s„ 

vhose coefficients, certain integral functions of the elements 
are invariants of the correspondence A. The “ norm ” s^ 
s merely the determinant of A. The first coefficient the 

trace 

5l == aji --j- + ’ ■ ' + ®nn — (2-9) 

s of more importance, as it depends linearly on the a,* : 
tr{Ai + A2) = trAi + trA^. 

If ./I is a linear correspondence of the m-dimensional vector 
>pace JR on the «-dimensional space @, and B is conversely a 
linear correspondence of © on 91, then we can build the corre- 
spondences BA of 91 on to itself and AB of © on to itself. These 
two correspondences have the same trace 

tr(i^^) = tr{AB) (2.10) 

br, in accordance with the rule of composition (2.4) and the 
definition (2.9) we have 

tr(i?^) = Eba-a^i. tr{AB) — Eaiib,^ 

i,k i,k 
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where i runs from 1 to w and k from 1 to n. The special case 
in which A and B are both correspondences of 91 on to itself 
naturally deserves particular consideration. 

§ 3. The Dual Vector Space 

A function L(5) of the arbitrary vector j of the form 

+ a2^2 + • * ' + an^« (3.1) 

is called a linear form. This concept is invariant in the sense of 
affine geometry : it can be defined by means of the functional 
properties 

L(«S) = .-Z,(jr), L(J + ?) = L(J) + L(t|). 

It is obvious that the expression (3.1) has these properties, and 
conversely, on introducing a co-ordinate system e,- and setting 
j = Bxiti, it follows that 

~ J CLi = Z.(c,). 

% i 

On going over to another co-ordinate system such that the 
components Xt of an arbitrary vector j undergo the transforma- 
tion (1.4), the linear form becomes 

2J<XiXi= E<Xi'Xi 

the coefficients a/ of which are related to the original a,- by the 
equations 

a*' = Eaik ' ct.i. 

% 

The coefficients a,- of a linear form are said to transform contra^ 
gredieatly to the variables x^. 

It is, however, not necessary to consider the as constants 
and the Xt as variables. When the a, do not all vanish the equa- 
tion L{f) = 0 defines a “ plane,” i.e. an {n — 1) -dimensional 
sub-space ; a vector j lies in the plane if its components satisfy 
this equation. But on the other hand we can ask for the equation 
of all planes which pass through a given non-vanishing vector ; 
the Xi = Xi" are then constants and the variables. It is there- 
fore most appropriate to consider the two sets {xi, X 2 , • • *, x„), 
(«i. * 2 . ■ * ■> “n) in parallel. 

We therefore introduce in addition to the space 91 a second 
n-dimensional vector space, the dual space P. From the com- 
ponents (^ 1 , ^ 2 . ’ ■ ^n) of a vector ^ of P and a vector 

{Xi, X 2 , ’ • *, x„) of 91 we can construct the inner or scalar product 

+ ^2^2 + • • ‘ + inX„ (3.2) 
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This product has, by definition, an invariantive significance, for 
when 91 is referred to a new co-ordinate system by means of 
a transformation of the Xi the variables of the dual space P 
undergo the contragredient transformation. This dual space is 
in fact introduced in order to enable us to associate a contra- 
gredient transformation with each one-to-one transformation. 
To repeat, two linear reversible transformations 

X — Ax', i = A^' ( 3 . 3 ) 

are contragredient with respect to each other if they leave (3.2) 
unaltered : 

+ ' +Lx„=ii'xi'+h'x 2 + ’ • • + L'x„'. ( 3 . 4 ) 

A vector J of 9i and a vector ^ of P are said to be in involution 
when their product (3.2) vanishes. A ray in 9i determines a 
plane in P, i.e. the plane consisting of the vectors which are in 
involution with the given ray, and conversely. Duality is 
a reciprocal relationship.f 

The dual or transposed matrix A* of a matrix A = ||<Zjt.|| 
is obtained by interchanging the rows and columns of A. 
A* — llaf^ll is therefore defined by a*^ = a*,-, and has m rows 
and n columns. We shall always employ the asterisk to in- 
dicate this process. And what is its geometrical interpretation ? 
Let 9? be an w-dimensional, <3 an n-dimensional, vector space ; 
A ■.‘j: t) a linear correspondence of 91 on ®, specified in terms 

of given co-ordinate systems in 91 and 3 by the matrix A : 

y* = IJoki Xi, 

i 

and let P, H be the dual spaces. The product 

ZV/cVk = ZdkiVkXii = UiiXi), 
k k,i 1 

where -q is an arbitrary vector of Z with components has then 
an invariantive significance. A bilinear form which depends 
linearly on a vector q oi Z and a vector j of 91 is therefore in- 
variantively associated with a linear correspondence of 91 on 3, 
and conversely. This gives rise, as the expression of the bi- 
linear form given in parentheses shows, to a correspondence 

q ^i— qjc 

k 

of Z on P, i.e. the dual A* of A. The reciprocal relation existing 
between the correspondence A and its dual A* may be expressed 

fin the theory of relativity it is usual to call vectors in 91 and P contra- 
variant and covariant vectors, respectively. 
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as follows ; if J is an arbitrary vector in and rj is an arbitrary 
vector in Z, then the product of the vectors AiC and rj is equal 
to the product of % and A*t}. The dual correspondences obey 
the linear laws 

{A, + A^)* = A,* + A^*, {aAy = a-A\ 

If ^ is a correspondence of 91 on <3 and B a correspondence of 
© on %, then since 

(BA)* A*B* (3.6) 

BA maps 9i linearly on S, and A*B* maps the dual space T 
of % on the dual P of 91. 

We have agreed once and for all to consider the set 
Xi, X 2 , ’ ’ Xn of components of a vector j as a column ; the 
inner product of the vector j in 9? with the vector ^ in P can 
therefore be written in matrix notation as $*x or x*$. The 
transformations (3.3), from the first of which it follows that 
X* = x'*A*, are consequently contragredient to one another if 

A* A = 1 or A = (A*)-\ (3.6) 

and we have arrived at an explicit expression for the contra- 
gredient transformation. 

Let 91' be an n'-dimensional sub-space of 91 = 9i„. All 
vectors of P which are in involution with the totality of vectors 
of 91' obviously constitute, in consequence of the simplest 
theorems on linear homogeneous equations, an (n — «') -dimen- 
sional sub-space P' of P. And from this we are led immediately 
to the result that if a correspondence A 0 / 'ift on itself leaves the 
sub-space 91' invariant, then the dual correspondence A* of P on 
itself leaves the associated sub-space P' invariant. 

Let 91 be decomposed into two or more sub-spaces 
9^1 + 9^2 -f • • • of dimensionalities n^, n^, • • •, and let the. 
sub-space of P which consists of all vectors in involution with 
all vectors of 912 + 913 + • • "be denoted by Pj, the dimension- 
ality of which is also Wj. Defining P2, P3 analogously, we arrive 
at the decomposition P = Pi + P2 + • • •, for the sum of a 
vector of Pj a vector of P2, etc., can only be. zero when each 
of the individual summands vanishes. In order to prove this 
latter statement, we note that if the sum is 0 then the first 
summand belongs to Pi as well as to P2 + P3 + • • i.e. it is 
in involution with all the vectors of 9I2 + 913 + * ' 'as well as 
with all those of 9Ii, and is therefore in involution with all the 
vectors of 91. But this is only possible if this first, and therefore 
any, summand is zero. Pi can be considered as the space dual 
to 9Ii, for if j is an arbitrary vector in 9li and tj a vector in P 
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with components V*’ in the various P^, then the product of 
j and 7] is equal to the product of 5 and 

If a correspondence ^ of 91 on itself leaves the n'-dimensional 
sub-space 91' invariant, then the (n — n')-dimensional sub-space 
P' is invariant under the dual correspondence A* of P on itself. 
If 91 is decomposed into 91i + 9 I 2 + • • • and if A leaves each 
of the sub-spaces 91a invariant, then A* leaves each of the sub- 
spaces Pa invariant. If A is any correspondence in 91 and {A]^p 
that portion in which 9la intersects then the portion [A*]p„ 
of yd* in which P^ intersects Pa is dual to [A]„p ; 

MIV (3.7) 

[A]^p maps 91/j on 91* and [A*]^^ maps the dual space P* on P^. 

All these results are conceptually evident, but can be seen 
even more readily directly from the matrices on adapting the 
co-ordinate system to the decomposition 91i + 9 I 2 + • * •• 

§4. Unitary Geometry and Hermitian Forms 

The metric is introduced into affine geometry by means of 
a new fundamental concept ; the absolute magnitude of a vector. 
In Euclidean geometry the sum of the squares 

f = + ’ . . + (4.1) 

of the components of a vector j = {xi, x^, * • •, x„) is taken as 
the square of its absolute value. The only co-ordinate systems 
which are then equally permissible are the Cartesian systems, 
in which the square of the absolute value of j is given by (4.1) 
in terms of the components Xi ; the range of values which the 
components may here assume is taken as the continuum of all 
real numbers. But the content of the preceding paragraphs 
is not bound to this choice ; the only requirement is, in fact, 
that the range of permissible values constitute a “ field ” in 
which the four fundamental operations (excluding division by 
zero) can be performed. We shall hereafter consider the con- 
tinuum of all complex numbers as the range of values which our 
components may assume. The expression (4.1) loses its definite 
character in this domain ; the sum of the squares can vanish 
without implying that each term is zero. It is therefore desirable 
to replace the quadratic form (4.1) by the “ unit Hermitian 
form ” 

* 1 X 1 + + • • • + (4.2) 

where x denotes the complex conjugate of a number x. The 
value j* of (4.2) will be taken as the square of the absolute 
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magnitude of the vector j = [x-^, x^, • • •, «„) and the correspond- 
ing bilinear form 

7 (E^) = ^iVi + + • • • + x„y„ 

as the scalar product (jt)) of the two vectors j and t) = 
(yii yii ' ■ yj- a co-ordinate system is said to be normal 
when the square of the absolute magnitude of a vector j is 
expressed in terms of its components Xi in this co-ordinate 
system by (4.2). In a normal co-ordinate system c< these 
components are the scalar products 

A^.-=(e.E). (4.3) 

The transformations which lead from one normal co ordinate 
system to another such, which therefore leave the form (4.2) 
invariant, are called unitary transformations. If 

The conditions which characterize unitary transformations 
are entirely analogous to those for orthogonal transformations, 
with which we are familiar from the elements of analytic geo- 
metry. Let X = Sx' be such a transformation ; under the 
influence of S the fundamental metric form (4.2) goes over into 

x'*S*Sx'. S is therefore unitary if and only if S*5 = 1 ; the 
fact that det S #= 0 follows immediately from this. Indeed, 
since a matrix 5 and its transposed S* have the same deter- 
minant, it follows that the determinant of a unitary transformation 
has the absolute value 1 : Idet 5|" = 1. These conditions may 

be expressed by the assertion that S* is the matrix S"' reciprocal 

to 5, and therefore not only S*S = 1 but also SS* 1 . The 
first of these equations states that the sum of the squares of 
the absolute values of the elements of a column is 1 and that 
the sum of the mixed products iSsnSrk of two different columns 

(t 4= ife) is 0 ; the second equation contains the same assertion 
for the elements of the rows. 

We carry over the terminology usual in Euclidean geometry. 
In particular, the vector Q is said to be perpendicular to j if 
the scalar product (j^) vanishes. In virtue of the symmetry law 

(IJE) = (^) 

perpendicularity is a reciprocal relationship. There exists no 
vector a, except a = 0, to which all vectors are perpendicular ; 
in fact, a = 0 is the only vector which is perpendicular to itself. 
Normal co-ordinate systems can be characterized by the fact 

t The name '"orthogonal” has been used in the physical literature to 
denote these transformations, but in mathematics it is necessary to have 
different names for these two different concepts. 
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that for them the scalar products of the fundamental vectors 
Cj among themselves are 

(e e)-s - = 

On comparing the fundamental metric form (4.2) with (3.2) 
it is seen that the unitary space 9fi can be characterized by the 
fact that its conjugate complex 91 coincides with its dual P, or 

more precisely, that the conjugate complex J of a vector j can 
at the same time be considered as its dual. Wc found that with 
a correspondence A of an ni-dimensional unitary space 91 on 
an n-dimensional is associated in an invariant manner the 
correspondence A* o{ the dual space £ on the dual P. As a 
consequence of the equation P = 91 for unitary spaces 

A* - A 

is a correspondence of © on 9t; we call it the “ Hermitlan 
conjugate of A.” AA is & correspondence of 9i on itself, 

A A ol S on itself. A correspondence © which carries the 
general vector j over into j' = 5 f is unitary if it leaves the 
absolute magnitude of j unaltered : — - 5 *. Two configura- 

tions consisting of vectors, either of which can be obtained from 
the other by a unitary transformation, are congruent in unitary 
geometry ; i.e. unitary geometry is the theory of those relation- 
ships which are invariant under an arbitrary unitary transforma- 
tion. The characteristic property of such transformations is 
expressed in terms of the matrix calculus by either of the two 
equations 

55=1, 55=1. 

Let 91' be an w-dimensional linear sub-space spanned by 
the linearly independent vectors 0,, a*, • • *, fl„. We consider 
a vector j as belonging to the sub-space 91" if and only if it is 
perpendicular to 9i', i.e. to all the vectors of 9f' ; such a vector 
must therefore satisfy the equations 

M = 0, (a,j) = 0, • • -, {a„t) = 0. 

From these it follows that 91" is (« — m)-dimensional. The 
relation between 91' and 9 t" is a reciprocal one ; every vector 
of 91" is perpendicular to every vector of 91' and conversely. 
We then have 91 = 91' -f 91". for if the sum y' -f of a vector 
j' in 9 i' and a vector jc" in 9 i" vanishes then j' = — s" is a 
vector which belongs to both sub-spaces and is consequently 
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perpendicular to itself, and this can only occur if j' = 0. A 
unitary correspondence which leaves 91' invariant will also leave 
91" invariant since the relation of perpendicularity will not be 
destroyed by such a transformation. In dealing with unitary 
correspondences or transformations tt is therefore always possible 
to find an invariant sub-space 91" associated with a given invariant 
sub-space 9J', such that 91 = 9f' The previous remarks 

about projection suggest that here in the unitary geometry we 
identify the space generated by projecting 91 with respect to 
91' witli the sub-space 91" : we project on to the space per- 
pendicular to 9f'. To this end we remark that among all vectors 
a in 91 which are congruent mod. 9f' there is one (a) which lies 
in 91" ; we then have 

(a • Q) = fl(a), (a + b) = (a) + (b). 

With an arbitrary linear correspondence A 

= At): yf = 2J^ikyk (4.4) 

k 

of 91 on itself is, as we have seen, associated a bilinear form 

yk 

ik 

which depends linearly on a vector ^ in P and a vector t) in 91. 
In unitary space we can therefore associate the form 

i?) = Saik^iyk, 

ik 

depending linearly on = (y,) and j = {»<), with the correspond- 
ence (4.4). It is in fact the scalar product of j and At). The 
special case in which 

A = A or A{t), ]C) = A{g, ^) or a*.- = a,* (4.6) 

bears the name of the French mathematician Hermite, The 
correspondence (4.4) is consequently Hermitian if the scalar 
product of J with the conjugate complex of the scalar 

product of Q with A J. On identifying with J we obtain the 
“ Hermitian form 

A{i) = Ail, 5) = Hai^ Xi a;*, (4.6) 

i.e. the scalar product of j and Ai ; in consequence of (4.5) its 
value is real. An Hermitian form or correspondence A is said 
to be non-degenerate if there exists no vector j, except J = 0, 
whose transform Ai vanishes It is positwe definite if the value 
of the form A{i) > 0 for all vectors J =t= 0 ; a positive definite 
form is non-degenerate. 

The fundamental metric form (4,2) is one such positive 
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definite Hermitian form, the “ unit form,” the coefficients of 
which consist of the numbers 


^ik — 


n {t k) 
\o [i 4= k)' 


On introducing an arbitrary co-ordinate system a,- (z = 1, 2, • • •, n) 
into the n-dimensional space, the absolute magnitude of an 
arbitrary vector 

I + • • • + x^an 


is given by 


t = ^gik Xk, gik ^ (a< Oft) . 


The expression for is accordingly always a definite Hermitian 
form ; conversely, any positive definite Hermitian form (j(j) 
could be taken as the fundamental metric form. To show this 
we employ the associated Hermitian bilinear form G(j, to 
carry through the following procedure, which is patterned after 
the step-by-step construction of a Cartesian co-ordinate system. 
Choose any non-vanishing vector Cj ; since G{e.^ > 0 we may, 
on multiplying Ci by an appropriate numerical factor, normalize 
it in accordance with the equation G(Ci) == 1. When the process 
of constructing a system of unitary-orthogonal vectors 

G(^i, 8,j(, 


has been carried through m steps, j — i, 2, • • •, m, the next 
step is accomplished by choosing a solution j = e,„+i of the 
ni < n homogeneous linear equations G(e,, j) = 0 for the n 
unknown components of the vector £ 4= 0 and normalizing it 
in accordance with the equation G(e„+,) — 1. The procedure 
comes to an end after n steps ; we tlien have n vectors 
Cii ‘ of such a kind that 


where 


J) = + ^2 ^2 + ' • ■ + X„X„ 


I = Xi^i + atjCz 4 - • • • + x„e„. 


It follows from the equations themselves that £ can only vanish 
when all of its components Xi vanish, and consequently the e< 
are linearly independent and constitute a co-ordinate system 
in JR. 

The transition from affine to metric geometry can accordingly 
be accomplished by the introduction of the axiom : 

(S) The square of the absolute magnitude of a vector £ is a real 
number which is a positive definite Hermitian form hi the 
components of £. 

These last considerations are useful in another connection. 
If JR' is a linear sub-space of JR we can employ the construction 
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used above to find m vectors 61, C2, • * *, e„, in 9i' which span 
and are mutually unitary-orthogonal in the sense of the equations 
(CjCjfc) = Sjjfc. By continuing the construction we can supplement 
these m fundamental vectors by « — - m additional ones 
C*n4ii * ' *> so that the two sets together form a co-ordinate 
system for the entire space We can therefore adapt our 
normal co-ordinate system to the separation of out of 91 or 
to the decomposition of 9f = 91' -f- into two perpendicular 
sub-spaces. 

Since the correspondence ^ of 91 on to itself is invariantively 
connected with the Hermitian form A in 91, we may speak of 
the product BA of two Hermitian forms A, B in SR, but this 
product is not in general Hermitian as 

^A = AB = AB. 

The trace of an Hermitian form or correspondence A is real. 
The positive definite expression 

trMi) = 2'U.J^ (4.7) 

is of particular importance. W'hen SR is decomposed into 
mutually perpendicular sub-spaces 9fa (a = 1, 2, • • •) the section 
Aoifi of the correspondence or form A in which 9Ra intersects SR3 
is uniquely determined ; it is a correspondence of SR^ on SR®, 

and Apn, the jSa-section of A, is a correspondence of SR® on SR^. 
When the co-ordinate system is adapted to the decomposition 
of SR we have 

tr (Aaff Aff-x) = tr [AgaiAa^ = (4.8) 

where in the sum i runs through the k through the jS*** set 
of indices. 

Any non-vanishing vector a determines a ray a which consists 
of all vectors of the form Ao, A being an arbitrary complex number. 
The generating vector a can be so normalized that its absolute 
value I a I = I ; this does not, however, determine 0 to within 
a change of sign, as in the real domain, as the normalization is 
unaltered on multiplying a by an arbitrary (complex) number e 
of modulus 1. We shall call the totality of vectors of 9R the 
vector field SR and the totality of rays the ray field SR. Any 
non-degenerate linear correspondence A of the vector field ^ 
on itself is at the same time a correspondence of the ray field 
SR on itself, but this latter correspondence is unaltered by 
multiplication with any non-vanishing number. A unitary 
correspondence or transformation of the ray field on itself will 
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3 briefly referred to as a rotation. By the symbol S' ^ S we 
lall mean that the two transformations S, S' of the vector 
eld on itself differ only by a numerical factor 6 of modulus 1 : 

' = sS, whence they both give rise to the same rotation of 
le ray field. 

§5. Transformation to Principal Axes 

The fundamental theorem on Hermitian forms is that con- 
;rning the transformation to principal axes. We are here 
jncerned with the analogue of the familiar problem of finding 
le principal axes of an ellipse or ellipsoid in the ordinary 
sometry of two or three dimensions. We wish to find a normal 
1 -ordinate system c,- associated with a Hermitian form A(jc) such 
lat in addition to 

j = Xidi + 4- • • • + x„e„ 

J* = Xi-r, 4- + • • • + x„x„ (5.1) 

e also ha7’e 

At) = 4- ajijXj 4- • • • + ci.„x„x„ ; (5.2) 

lat is, A shall be brought into the normal form (5.2) by means 
f a unitary transformation. The real numbers a,, a*, • • •, a„ 
re called the characteristic numbers of the form A, and 
, Cj, • • c„ the corresponding characteristic vectors. 

To this end we first consider the correspondence j -*■ 
ad seek those vectors J =j= 0 which are transformed into 
lultiples g' = Aj of themselves by A. We then obtain the 
secular equation ” 

/(A) = det (A1 ~ A) =0 

)r the multipliers A. According to the fundamental theorem of 
Igebra this equation certainly has a root A = ; corresponding 

) it a non-vanishing vector % = ti can be found which satisfies 
le equation /le, = ajC,, and on multiplying this vector by an 
ppropriate numerical factor we may take it such that its modulus 
unity, c, can then be supplemented by n — 1 further vectors 
e„ in such a way that these n vectors constitute a normal 
}-ordinate system. In these co-ordinates the formulae 

c/ = ACi = 

k 

>r the correspondence A require, in accordance with the 
sfinition of c„ that the coefficients Oj,, ag,, • • *, vanish and 
lat <»,i = a,. Because of the symmetry conditions a*, = dik, 
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^12) ®i3) ■ ’ ') ^in must also vanish. Hence in the new co-ordinates 
the matrix A assumes the form 


«1 

0 

0 • 

• 0 

0 

^22 

^23 

' a2n 

0 

^32 

^*33 


0 

^n2 


^nn 


and the Hermitian form becomes 

A{i) - + A'ii) (5.3) 

where A' is an Hermitian form containing only the w— 1 variables 
^2) ^3> ‘ ' *1 *n- Repeating this process, or calling on the method 
of mathematical induction, we establish the validity of the 
fundamental theorem stated above. 

The characteristic polynomial of (5.2) is 

det (A1 — A) — (A — «i)(A — aj) • • • (A — a„). 

From this it follows that the characteristic numbers cr.y, 
«2) * ' ■) *n> including their multiplicity, are uniquely deter- 
mined by the Hermitian form A ; their sum is the trace of A. 
What can we say concerning the characteristic vectors ? Let 
a be a given real number ; the vectors jc which satisfy the equa- 
tion Aje = aj constitute a linear sub-space 9f(a) of 9R, the 
characteristic space belonging to a. When the normal 
co-ordinate system c< is so chosen that A is in the normal form, 
the equation AjC — aj is, in terms of its components, 

a, AT,- = a.Xi 

from which it follows that 9i(a) is spanned by those vectors e, 
for which «.{ = a. If, for example, the three roots aj, aj, 0 . 3 = x 
while all the others are different from a, the characteristic space 
91(a) is 3-dimensional. If none of the characteristic numbers 
Xi is equal to a, 91(a) consists only of the vector 0. This again 
characterizes the characteristic numbers, including their multi- 
plicity, in a way which is independent of the particular co- 
ordinate system chosen, and in addition it characterizes the 
corresponding sub-spaces 91(a). 91 is thus decomposed into the 

characteristic spaces 91(a) : 91 = Z’91(“) I only a finite number 

<x 

of terms occurs in this sum, i.e. those for which a is a character- 
istic number of A. A complete co-ordinate system Cj, €3, ' • % 
for the entire space 91 can be obtained by choosing a normal 
co-ordinate system in each non-null sub-space ^(a). The 
normal form (5.2) is undisturbed on subjecting the variables 
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Xi associated with the same characteristic number a, = a to an 
arbitrary unitary transformation. 

If, for example, a is a triple characteristic number 

“i = *2 = — oc 

while the remaining a,- 4 = a, then Xity + ATjCa + is the 
normal projection of the vector j on ^(a) and 

EM = + » 2*2 + X 3 X 3 

is the scalar product of j# with itself. The equations (5.1), 
(5.2) may then be written in the invariant form 

t=EEM, A{i) = E<^-EM- (5.4) 

a a 

91' being a sub-space of 91, any vector 5 can be uniquely 
broken up into j' -j- Jo where j' lies in 91' and Jo is perpendicular 
to 91'. The “ orthogonal projection ” j -> j' = /i'j is a linear 
correspondence which obviously has the property 

E'E' = £', (5.5) 

for the projection of j' on 91' is simply j' itself. Furthermore, 
the operator E' is Hermitian, for the scalar product of 1) into j' 
is equal to the scalar product of into j', where t)' is the projection 
of ^ on 91'. (The Hermitian form ii'(j) is accordingly the square 
of the absolute value of j'.) We shall call Hermitian forms 
which satisfy equation (5.5) idempotent. 

When the sub-spaces 91'. 91" are orthogonal, the two corre- 
sponding projection operators E' , E" satisfy the equations 

E'E" = 0, E"E’ = 0, (5.6) 

for E' (F"j) is the component of £"j lying in the space 9i' per- 
pendicular to £"j. Idempotent operators which satisfy these 
equations are said to be independent. The second equation is, 
moreover, a consequence of the first, as may be seen on going 

over to the Hermitian conjugate ; E"E' = 0. If 91 is decom- 
posed into several mutually orthogonal sub-spaces 91' +91"+ • • •, 
then 

J - E'l + E"i + . . .. (5.7) 

It is easily shown that the converses of all these assertions 
are also valid. If E' is an idempotent operator and E" = 1 — E', 
all vectors of the form E'% constitute a linear sub-space 91' and 
all vectors of the form £"j a sub-space 91". The equation 

E'E" = E'E" = E'{i - F') = 0 
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shows that the scalar product of a vector E'% in 91' and a vector 

E"\) in 91" is zero : xE'E”y — 0. The decomposition of a 
vector J into a component lying in 91' and one perpendicular 
to 91' is accordingly expressed by 

j - E'E + (1 - E% 

If the two idempotent forms E', E" satisfy the equation (6.6) 
then, as we have just seen, the two corresponding characteristic 
spaces 9i', 91" are mutually perpendicular. If the sum (5.7) 
consists of independent idempotent forms, then by the above 
the corresponding mutually perpendicular sub-spaces 91', 91" 
exhaust the entire space 91. 

The theorem on transformation to principal axes can accord- 
ingly be stated : An Hermitian form A associates with the real 
numbers a mutually independent idempotent Hermitian forms Ed 
such that 

1 = ZE,, A = Za-E„-, (5.8) 

a OL 

Ed is non-vanishing for only a finite number of values a. 

A correspondence A can be reiterated : 

AA = A\ A^A = /!*,••• 

and we can accordingly obtain polynomials 

f{A) — Cgl -j- CiA + c^A- -f- • • * + C/,A'' 

in A with numerical coefficients c. On reiterating (6.8) h — i 
times 

A'> = 

a 

whence for the general polynomial / 

f{A) = ZmE^. (6.9) 

a 

The characteristic numbers of f{A) are therefore the values of 
the polynomial /(a) for the characteristic numbers a of A. This 
suggests defining the Hermitian form f{A), where /(a) is any 
real function of the real variable a, by means of the equation 
(6.9) 

Given two Hermitian forms A, B, under what conditions can 
they be brought simultaneously into diagonal form, i.e. when is 
it possible to find a normal co-ordinate system in which 

Ail) = + • • • 4- a„x„;r„ 


( 6 . 10 ) 



TRANSFORMATION TO PRINCIPAL AXES 26 

A necessary condition is that they commute : BA =- AB, for if 
A and B are in the normal form (6.10) BA as well as AB is 
the diagonal matrix with elements This condition 

is also sufficient ; to prove this, chqose a normal co-ordinate 
system in which A is already in normal form. The equation 
BA = AB requires that the matrix B = satisfy 

bik«-k =■-- <r-ibik or (a, — cf.^hik = 0. (5.11) 

We divide the indices i, the fundamental vectors Cj and the 
variables Xi into classes by considering i and k to be of the same 
class if OLi = a*. Equation (5.11) states that = 0 when 
i and k belong to different classes. B is consequently decom- 
posed into smaller matrices B', B” aligned along the principal 
diagonal, corresponding to the way in which the a,- are distri- 
buted in classes a', a", • • • ; the correspondence B consequently 
leaves each of the characteristic spaces 9^(a'), 9I(a"), • • • of A 
invariant. But we can then choose a normal co-ordinate 
system in each of these characteristic sub-spaces 91(a) in such 
a way that the Hermitian correspondences B', B" in them are 
referred to principal axes ; the normal form of A is undisturbed 
by this procedure. 

This process can immediately be applied to any number of 
Hermitian forms ; Any number of Hermitian forms can be brought 
simultaneously into normal form if and only if they commute 
with one another. By a slight modification we can further 
extend this theorem to an arbitrary finite or infinite system E of 
Hermitian forms. This will be briefly discussed here, although 
in general the consideration of systems of forms or correspond- 
ence is postponed until Chap. III. Let the space 91 be decom- 
posed into mutually perpendicular sub-spaces 91', 91", • • • in 
such a way that each correspondence of the system E takes 
place in these sub-spaces ; on adapting the co-ordinate system 
to this decomposition each Hermitian matrix A of E consists 
of sub-matrices A', A”, • • • aligned along the principal diagonal. 
If all the A' are already multiples of the unit matrix 1 in 9i' 
and similarly for all A”, • • •, our goal is reached, for each corre- 
spondence A of the system then transforms 9i' into itself and 
is a simple multiplication in it ; similarly for 91", • • *. But if 
this is not the case let be a correspondence of the system 
which is not merely a multiplication in the sub-space 91'. On 
transforming the constituent A' of A to principal axes, 91' is 
decomposed into characteristic spaces 91/ -f 912' -j- * * * of A', of 
which there are at least two. For any Hermitian matrix X 
of E we have A'X' = X'A', from which it follows, as we saw 
above, chat X' transforms each of the sub-spaces 9Ij', 9l2'i ' ' * 
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into itself. The decomposition 91 ' + 91 ^^ + ‘ ' ‘ can thus be 
further reduced to the decomposition (9ti' + Sli' + ’ * ' ) + 
91 " + • • Proceeding in this way we finally reach our goal 
after at most n steps, proving : 

The Hermitian forms of any system S can he simultaneously 
referred to principal axes if they all commute with one another. 

The theory developed above for Hermitian correspondence is 
valid as it stands for unitary tram formations. S being any unitary 
operator, a normal co-ordinate system e< can be introduced in such 
a way that S carries each of the fundamental vectors t, over into 
a multiple cr.e,- of itself. The characteristic numbers a, of S are 
numbers of modulus 1. In these co-ordinates the matrix of S 
is a diagonal matrix, the elements in the principal diagonal 
of which are the numbers o,. 

The proof is quite analogous. We again start with the 
secular equation 

det (al — S) ~ 0 

and consider the root ctj. There then exists a vector Cj of modulus 
1 which is transformed into criCi by the correspondence S. Sup- 
plement tx with n — 1 further vectors C 2 , * • *, Cn so that these n 
vectors form a normal co-ordinate system. In these co-ordinates 
the matrix ||s,t|| of the correspondence S : 

•S'C,' = 

k 

is again of the form 

^ll = CTj, 5'2 i = • • • = — 0. 

Since 5 is unitary the sum of the squares of the moduli of these 
elements of the first column must be unity, whence |cri| = 1. 
Similarly the sum of the squares of the moduli of the elements 
in the first row must also be 1 : 

+ ^12!^ + * ' • + 1 ; 

but since joil* = 1 it follows that 

Si2= ' • ' = Si„ = 0 . 

The matrix S is now broken up into a 1-dimensional Oj and 
an (n l)-dimensional S' as in (5.3) ; the truth of the above 
theorem then follows immediately by induction. 

The further results can be obtained in exactly the same way 
as above for Hermitian forms. The characteristic numbers 0 -^, 
including their multiplicity but not their order, are uniquely 
determined by S, and similarly for the corresponding sub-spaces. 
If we wish to find a linearly independent system of character- 
istic vectors, the fundamental vectors of each such sub-space 
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may be taken as forming a normal co-ordinate system. Finally, 
a finite or infinite set of unitary transformations can be simul- 
taneously reduced to normal form if and only if they commute 
among themselves. 


§ 6. Infinitesimal Unitary Transformations 

A rigid body in continuous motion about a fixed point 0 
performs an infinitesimal rotation in each interval dr of time. 
Denoting by (dx^^ dx 2 , dx-^) the infinitesimal displacement of 
that point of the rigid body which is at the point P{xi^ x^^ x^) 
at the time r, the equations of motion of the body must be of 
the form 

dx ■ 

= ( 6 . 1 ) 

in which the coefficients are constants, i.e. independent 
of the particular point P under consideration. Employing a 
Cartesian co-ordinate system with 0 as origin, Xi^ X 2 ^ + Xs^ 
must remain unchanged throughout the motion ; this requires 
that 

dx • 

2X-r-‘-0 or 
i dr 


Since this equation must be satisfied identically in the the 
matrix C ~= which characterizes the motion must be anti- 
symmetric : Introducing the vector r with origin 

at 0 and terminus at the point P, and the vector c — (^23. C31, Cjg), 
equations (6.1) become 



the familiar fundamental formulae for the kinematics of a rigid 
body. The square brackets denote the vector product and C 
the vectorial angular velocity, the absolute value and direction 
of which give the angular velocity and direction of the axis of 
rotation respectively. 

The continuous compounding of interest offers another 
example of an infinitesimal linear transformation. The interest 
rate being a real number, the increase in the capital x in time 
dr is xedr. Radioactive disintegration is the same kind of a 
process with negative c. The capital x^ considered as a function 
of the time, satisfies the equation 

dx 


( 6 . 2 ) 
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and consequently increases exponentially with t. If the prin- 
cipal has the value Xg at time t = 0, it will have increased to 

x(t) = Xo' 

at time t. To obtain an alternative solution we divide, as in 
the method of finite differences, the time interval r into a large 
number n of equal elements t/h ; x will increase by xcr/n in 
each of these intervals -and the capital x will accordingly be 
multiplied by (1 -}■ at the end of time t. The familiar 

definition 

= lim (l + (6.3) 

n -y 00 \ ^ 


of the exponential function follows from a comparison of these 
two results. But we can also solve the differential equation 
(6.2) by the method of successive approximations. We take as 
the 0**' approximation the initial value X(, : Xo{t) = Xq. The 
(n T" l)st approximation is obtained from the n*** by substituting 
the latter in place of x on the right-hand side of (6.2) and 
integrating ; 

T 

Xn+i{r) = ^0 4- c^x„{t)dt. 

0 

On carrying out this process we find 

=^.W = 4+?( + --- + !^’), 


from which we obtain the familiar power series expansion 


^ 1 ! ^ 2 ! ^ 


(6.4) 


for the exponential function. The convergence of (6.3) and 
(6.4) and the identity of their limits is rigorously proved by 
elementary analysis. 

These examples will assist in understanding the concept of an 
infinitesimal unitary transformation of the n-dimensional 
space 91 = 9l„, which we now proceed to introduce. In order 
to avoid the use of infinitesimals we introduce a (purely fictitious) 
time T and think of the infinitesimal linear correspondence which 
carries the vector j over into jc + dj as taking place in the time 
interval dr : 


dj 

dr 


= Cl, 


dXi 

dr 


ZCf Xi.. 
k 
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(For the sake of brevity we refer to this simply as “ the in- 
finitesimal transformation C.”) Since the transformation is 
unitary, on employing a normal co-ordinate system niust 


remain unchanged : 


E'^i 


- dXi 


dr 


4 " E^k 

k 


dxj. 

dr 


=- 0 . 


(6.6) 


On setting 


dXi 

dr 


Zca x^, 

k 


dXk 

dr 


S'Cki Xi 


the left-hand side of (6.5) reduces to the Hermitian form 

” 1 “ iki)XiX]c 
«, k 

and since it must vanish identically in the x, we must have 
Ofc + Cm = 0, or the transformation C is anti-symmetric in 
the sense of the equation 

Cik = — Cki, C = — C. (6.6) 


In the real domain there exists no intimate relationship between 
symmetric and anti-symmetric matrices, but the situation is 
different in the complex domain. For on setting C — iH {i being 
the imaginary unit V — 1) it follows from (6.6) that H satisfies 


the equation H = H, and C is consequently i times an Hermitian 
matrix. In an infinitesimal unitary rotation of a vector field the 


dx 

velocity — is related to j by means of a correspondence whose matrix 
dr 


is i times an Hermitian matrix. The theorem on transformation 
of Hermitian forms to principal axes is accordingly the limiting 
case of an analogous theorem on unitary transformations. 

By repeated application of the infinitesimal unitary trans- 
formation 


dl = dr^Cl (6.7) 

we obtain after time r 


I l{r) = U{r)% = e^^i: (6.8) 


where the exponential function for a matrix A can be defined 
by either 


lim 

h 00 



or the power series 


1 + 


± 4-^*4- 
11^2!^ 


Naturally 


U{t + t') = U{t) U{t). 
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Accordingly U{t) runs through all the transformations of a 
1-parameter continuous group of unitary transformations gener- 
ated by the infinitesimal transformation C ; the parameter t is 
additive on composition. The power series is obtained by the 
method of successive approximations ; this method can also 
be applied to obtain a solution in the more general case in which 
the infinitesimal unitary transformation C is not the same for 
each time element dr, i.e. in which C is a matrix C{^) depending 
on the time t. The solution of the equation 


dr 


ck^)i 


for this case is given by 

J(t 2) = t7(T2T,)x(Ti) ; 

the unitary transformation which takes place in the 

time interval tj, obeys the law of composition 

Uir^Ty) = U(t3T2)U(t2T,). 

If j = Jo at time t = 0, the formulas for the successive approx- 
imations Ji (t) are 


r 

loW Jo : Ji+iW == Jo + \c{t)ii[t)dt ; 

0 

00 

for U{r) = [/(rO) we obtain the infinite series yUi (t) in which 

i 0 

T 

t/o(T)-l; U,Ur)~--\C{t)UMdt. (6.9) 

0 

Written explicitly, 

Ut{r) == n • • • \C{t,)C{l^) • - • C{t,)dl,dt^ • • • du. 

(0 s t, <; <, s ■ • ■ s q <; T) 

The proof of the convergence of this process is readily ob- 
tained with the aid of the quantity | A \ associated with a matrix 
A = \\ a,i II by the equation 

1^1*=: tr {AA) = 2:\ a, k\^. 

i, k 

It follows from the well-known Schwarz inequality 

I fll -j- a2 ^2 "T ‘ ■ ' + I* 

^ (Uil^ + • ' • + + • • • + \bn\^) (6.10) 

that 

\A + B\^ \A\+ IRI 

and that 

\AB\ ^ \A\ \B\. 
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The second inequality is obtained by applying (6.10) to the 
element 

^ik “ ^Tk 

o(C = AB: 

kifcl* ^ Z\brk\^ 

r r 

and summing with respect to i and k. The first inequality may 
be stated in the form 

T T 

0 0 

for integrals. The convergence of ZUi (t) can now be established 

i 

with the aid of these auxiliary results, for we can prove that 
under the assumption 

I C{t) I ^ c (0 ^ ^ t) 

that 

|f;,(r)l ^ 

For this is certainly true for I — 0, and the recursion formula 
(6.9) enables us to conclude that it holds for t/,+, if it holds for 
Ui. The convergence follows from this absolute convergence, 
for the absolute value of each component of the matrix A is 
certainly not greater than [ A I. 

We have only gone into these matters to reassure the reader 
of the legitimacy of dealing with infinitesimal quantities of the 
kind met here. The only thing of importance for the following 
is the simple relation existing between infinitesimal unitary 
transformations and Hermitian forms. 

§ 7. Remarks on oo-dimensional Space 

The unitary spaces which appear in quantum mechanics 
usually have an infinite number of dimensions. Such a space 
consists of all vectors 

I = (^ 1 , « 2 , • • •) 

whose components at, constitute an infinite sequence of numbers 
for which 

E* = + ^2^2 4- • • • 

converges. Within this domain addition and multiplication 
with numbers, as well as the construction of the scalar product 
of two vectors, are possible. All the axioms employed so far 
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are satisfied, with the exception of the dimensionality axiom y 
introduced in § 1. 

Since the vector components Xj, ^ 2 , • • • constitute a de- 
numerable set, this “ Hilbert space ” has a denumerably infinite 
number of dimensions. But in addition to these, spaces of 
non-denumerably infinite dimensions may occur. Consider, for 
example, all continuous complex functions t/i(s) of a real variable 
s of period 277. We need not distinguish between two values of s 
which are congruent mod 277, i.e. whose difference is an integral 
multiple of 277 ; it is consequently more convenient to consider ^( 5 ) 
as a function defined on the periphery of the unit circle than on the 
straight line. The various values of s at points on the circum- 
ference play the r61e of indices, the value at the point s being 
the component of the “ vector t/i ” with index s. The totality 
of such functions therefore constitute a linear “ function 
space ” of continuously infinite dimensions. Addition of these 
vectors and multiplication by a number have here the same 
interpretation as in the ordinary operations with functions. 
The square of the absolute value of the vector tp is taken to be 

(•A, '/') = 

0 

and the scalar product of two vectors </> and >/i as 

0 

A set of functions 

4>l{s), M^), • * •. i>n(s) 
constitutes a unitary-orthogonal system of vectors if 

27t 

l^i{s)<f>k{s)ds = 8,*. 

0 

These vectors span an n-dimensional sub-space 9l„ of the 00 -di- 
mensional function space, i.e. that sub-space consisting of all 
vectors of the form 

(Pis) = Xi^i(s) -f Xjfpiis) + • • • -f x„(p„{s). 

■ ■ ■) are the components in the co-ordinate system 
•Pi, 4>2, ' ‘ 4*n of tho vector <p{s) in 9l„. We have 

2n 

i<P, <P) = lfi^)‘P{^)ds = XiXi 4- *2^2 4- * • • + x„x„. 

0 
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An arbitrary vector tf/ can be broken up into a component ^ 
which lies in 9fl„ and a component ijt' perpendicular to 9l„ : 

n 

<f>{s) = ZXi <f>i{s), \^i{s)tl,\s)ds = 0. 

It follows from these equations that [cf. (4^.3)] 

tn 

Xi= \ f,is)>jj{s)ds. 

0 

These integrals are called the Fourier coefficients of the function 
i/r with respect to the orthogonal system <f>i. The orthogonal 
projection (f) on cannot be longer (i.e. have greater absolute 
magnitude) than ip itself ; this is the content of the so-called 
Bessel inequality 

2n 

XyXy + ^ 2^2 + ■ ■ ' + X„X„ ^ jlj>(s)>/l(s)ds. (7.1) 

0 

In fact, since (<f>, ip') = 0, {ip', <p) — 0, the ‘‘ Pythagorean theorem” 

iP, P) - iP. P) + iP', PI 

holds. 

The simplest unitary-orthogonal system in the domain of 
periodic functions, with which the theory of Fourier series is 
concerned, consists of the functions 

e{ns) [n = 0, -J, 1, ± 2, • • • ; e{x) = (7.2) 

V 27r 

This infinite system has the property of completeness', it 
is a complete co-ordinate system for the entire function space. 
The theorem that any periodic function p{s) can be expressed 
as a linear combination of the functions (7.2) : 

4- OO 2n 

ft - ■ X) 0 

(Fourier expansion of ip(s)) is true only if certain conditions 
concerning the differentiability of tp(s) are fulfilled, but any 
continuous function satisfies ParsevaVs equation 

4 - 00 

S)ds = Z 


U(s)0( 


(7.8) 
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We learn from this example that there is no essential distinction 
between spaces of a denumerable and of a non- denumerable infinitude 
of dimensions ; we have introduced into our function space 
a complete normal co-ordinate system (7.2) consisting of a 
denumerably infinite set of fundamental vectors. In an n- 
dimensional unitary space a system of unitary-orthogonal 
vectors is complete if their number is n, but not if it is less ; 
however, such an enumeration gives no criterion for oo -dimen- 
sional space. If we leave out a finite number of the functions 
(7.2) we still have an infinite set left, but the completeness of the 
system is destroyed thereby. The real criterion for complete- 
ness lies in the validity of the completeness relation (7.3). 

We can understand the relations existing in Hilbert space 
by analogy with or as limiting cases of those existing in spaces 
of a finite number of dimensions. If we consider the values of 
an arbitrary periodic function if{s) only at the points 


and set 


n 




-, (n - 1) 


71 



we are dealing with an n-dimcnsional vector space in which the 
components of the arbitrary vector ip are these quantities 
(v = 0, 1, • • •, n — 1). Let Ca be the vector in this space 
with components 



these vectors (A = 0, 1, • • *, n — 1) constitute a normal co- 
ordinate system for the space, relative to which the vector ^ 
has the components Xq, Xi, • • •, x^-i which are to be calculated 
from 


ft- 1 


= 0 

In accordance w’ith (4.3) 


n - 1 n- 1 


4 ^ — 2] 

1^-0 ;i-o 


whence 
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By passing to the limit n oo we obtain the equation of ParsevaL 
We do not concern ourselves here with the further considerations 
which may be necessary to establish a rigorous proof, but content 
ourselves with such reasoning by analogy. 

We consider the linear correspondence or “ operator ” 

D ~ which transforms a function Jj(s) in the domain of 

t as 

periodic functions into e{ns) is the characteristic vector 

(characteristic function) of this operator belonging to the 
characteristic number n : 


1 de{ns) 
i ds 


— n • e{ns). 


This operator is Hermitian ; the scalar product of and Difs 
is the conjugate complex of that of th and where ^ and ifj 
are any two periodic functions, for by partial integration 




0 


and the right-hand side is conjugate to 


2.-1 



0 


In fact, the Hermitian form 


assumes the normal form 




4 - 00 

n— -00 


( 7 . 4 ) 


in the normal co-ordinate system whose fundamental vectors 
are the characteristic vectors of the operator D. The reiterated 
di 

operator DD = — ^ appears in the theory of the vibrating 
string, together with the corresponding Hermitian form 

0 0 

which represents the kinetic energy of the string. 
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We have here been dealing with a discrete spectrum of char- 
acteristic numbers. But in an oo-dimensional space Hermitian 
forms with a continuous spectrum can also be constructed. 
Consider, for example, the function space consisting of all con- 
tinuous functions ^( 5 ) defined in the interval — tt ^ .y ^ ^ ; 

the square of the absolute magnitude of the " vector ” if/ is then 

{•/>, >P) = ^f{s)»p{s)ds. 

— n 

The Hermitian form 

+ » 

A[tf>] = ^s^{s)>/>{s) ds (7.6) 

— n 

is already in normal form, which shows that it has as character- 
istic numbers all numbers between — n and + n. The functions 
(7.2) again constitute a complete normal co-ordinate system in 
terms of which 

-f 00 

II S3 00 

Substituting this in (7.5) we find 

+ n 

A[*{>] = Za„nX„,x„, a^„ = ms)e{ns)ds. 


The evaluation of 

-f n 

• e\{n — m)j>]<f5 

— n 

yields 0 when n = m and by partial integration 

r 1)-- 

L t{n — m) j-„ t[n — m) 

when « 4 = wi. The Hermitian form 



(— 1 )"“"* 
n — m 


x^x 


n 


n*^ m 


has therefore as characteristic numbers all values between 
— TT and TT. 

The characteristic vector belonging to the characteristic 
value a (— TT ^ a ^ -f- tt) of A[t}i\ is that function which vanishes 
at all points 54 =* and is there so large that the integral of 
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has the value 1. Of course such a function does not really 
exist, but we can approximate it as closely as we wish. In 
order to arrive at a formulation which is mathematically rigorous 
for the case of continuous spectra, we must introduce in place 
of the idempotent Hermitian form in (5.4) the idempotent 
form A£ = 2J Ex for the entire interval A == Af (a ^ A < j3). 

For any given vector y 

^E[l) ^ 0, AfE(s) + A^(y) ^lE(l) (7.6) 

and the idempotent forms Afi associated with two separated 
intervals A are mutually independent. 

In dealing with the continuum, the sum in (5.4) is replaced 
by a Stieltjes integral. Consider the straight line described by 
the real variable A as being covered with a substance, and let 
the amount of this substance on the interval A be denoted by 
Am. We then have, in analogy to (7.6), 

Am ^ 0, Afm + A^m = A^m. 

If (^(A) is a continuous function of position we can construct 
the integral 

1 

[4>Wd,m. (7.7) 

0 

An approximation to this integral can be found by dividing the 
entire interval 0 A ^ 1 into small intervals A,-, choosing a 

point Xi in A, and evaluating the sum ' A,m. This sum 

\ 

then converges to the integral on allowing the A, to approach 
zero. If the distribution has a continuous density 

Am 
lim 

the integral is identical with J</>(A)p(A)(iA. But the Stieltjes 

0 

integral (7.7) also includes the cases in which there exists no 
finite continuous density ; in particular, it allows the existence 
of discrete points at which a hnitc amount of the substance is 
concentrated. If the substance is distributed over a finite 
number of points A -- a, in amounts m„ the Stieltjes integral 
reduces to the sum 

i 

We thus arrive at the following more inclusive formulation 
of the fundamental theorem concerning the transformation to 
principal axes : {1) The Hermitian form A associates with each 
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interval A an idempotent form ; (2) when two adjacent 

intervals Ai, Aj are added together to form an interval A, 

AE = AjE + AjE, 

and the idempotent forms associated with separated intervals are 
independent ; (3) we have 

— 00 — 00 

In this form the theorem is adapted to the appearance of con- 
tinuous spectra of characteristic numbers, and is particularly 
appropriate for the purposes of quantum mechanics (cf. II, § 7). 
The discrete characteristic numbers lie at those points where 
the monotonic increasing function =£'(A; j) of A has 

a discontinuity. In our example (7.5) 

here must be taken as 0 outside the interval (— tt, + tt). 
The evaluation in terms of the co-ordinates x„ is readily accom- 
plished. 

Consider the function space consisting of the totality of 
all functions 1 / 1 ( 5 ) of a variable s, which assumes all values from 
— 00 to + 00) and which have a finite absolute magnitude 

+ 00 

(</>< </<) = 

— 00 


i.e. which are “ integrable square.” The characteristic functions 

associated with the linear correspondence tjj{s) ^ ^ are again 

the functions e{vs), but the frequency v can now assume all real 
values. The components of ij){s) are the quantities 


+ 00 




Fourier's integral theorem then allows us to conclude the validity 
of the expansion 

+ 00 

— 00 
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under certain assumptions concerning the differentiability of 
the function ip{s) ; but in any case the completeness relation ^ 

+ 00 +00 

^^{s)ifs{s)ds = ^f{v)f(v)dv 

~ 00 - 00 

is valid. We arrive at a somewhat different problem when we 
only require that the functions ifj{s) be such that ifj{s)ilj{s) 
possess a definite mean value 

+ a 

Jim = ('A, lA) ; 

— a 

this leads to the theory of almosi-periodic functions developed by 
//. Bohr,^ Here again the validity of the completeness relation 
can be established. 

The theory of the characteristic numbers of Hermitian forms 
in infinitely many variables has been developed by Hilbert and 
Hellinger^^ but it is applicable only to bounded forms 

Al) = Eaiki,Xk, 

i.e. forms whose values have a fixed upper bound when 

^1. (7.8) 

i 

Indeed, without this assumption^ we cannot guarantee the 
convergence of A(g) in the entire domain (7.8) ; as an example 
consider the form (7.4), That this form only converges 

n 

in a portion of the domain (7.8) is merely another expression of 
the fact that not every continuous function is differentiable. 
The situation is more favourable for unitary forms as they 
satisfy the condition that they be “ bounded ” in consequence 
of their very definition ; a unitary transformation is thereby 
to be taken as satisfying both of the conditions 

UU==1, UU=L 

The theorem on principal axes has been proved rigorously for 
bounded Hermitian and for unitary correspondences in oo- 
dimensional space. A method due to A. Wintner ^ seems 
particularly appropriate for dealing with unitary correspond- 
ences ; it IS based on the consideration of the discrete group of 
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all powers C/" of the given unitary transformation U, and deter- 
mines the monotonic increasing function E{X ; j) of the real 
variable A (0 ^ A ^ 27r) by means of the equations 

271 

U-{s) = {^‘»^4E(A;e) (7.9) 

0 

(the problem of trigonometric moments). J. v. Neumann ® has 
gone furthest in dealing with linear operators for which bounded- 
ness is not postulated. In accordance with § 6 with a Hermitian 
form A is associated a group of unitary correspondences f/(T) 
depending on the real parameter t and satisfying the equation 

t7(r + T')- U{r)U{r')- (7.10) 

the study of this group is equivalent to the study of A. It is 
therefore perhaps appropriate to replace this latter for oo- 
dimensional space by the former, for no convergence difficulties 
appear in the domain of unitary transformations. We must 
therefore attempt to bring the operators U{t), which are con- 
tinuous functions of the real parameter t satisfying (7.10) 
simultaneously into the form 

in 

U{t- E) = f«-^rfxi?(A:E). (7.11) 

0 

This is accomplished with the aid of Wintner’s method on re- 
placing the discrete parameter n in (7.9) by the continuous 
parameter t. The problem (7.11) bears the same relation to 
(7.9) as Fourier’s integral bears to Fourier series. 

In setting up a system of axioms for oo-dimensional vector 
space the axioms (a), (/3) of § 1 and the metric axiom (S) of § 4 
can be retained ; for the proper substitute for the dimension 
axiom (y) see, e.g., v. Neumann, “ Mathematische Begriindung 
der Quantenmechanik.” * 

The algebraic and geometric tools developed in this chapter 
offer a natural medium for the expression of quantum mechanics ; 
they already hold a dominating position in the classical physics 
of continuous media. A masterly exposition of their mathe- 
matical content and application is found in the first part of 
Courant-HilberV s “ Methoden der mathematischen Physik,” 
2nd ed. (Berlin, 1930). 



CHAPTER II 


QUANTUM THEORY 


§ 1. Physical Foundations ‘ 


T he magic formula 

E — hv 


( 1 . 1 ) 


from which the whole of quantum theory is developed, establishes 
a universal relationship between the frequency v of an oscillatory 
process and the energy E associated with such a process. The 
quantum of actio'n h is one of the universal constants of nature 


h = 6-547 X 10”^’ erg secs. 


It was first discovered by Planck at the turn of the century in 
the laws of black body radiation ; that is, radiation which is 
enclosed in a cavity and is in thermodynamic equilibrium with 
matter of a definite temperature, which by emission and ab- 
sorption causes an exchange of energy between the various 
frequencies contained in the radiation. Since this equilibrium 
is independent of the particular nature of the matter involved, 
Planck considered, as a kind of schematic matter, a system of 
linear oscillators of all possible frequencies. A charge oscillating 
with frequency v interacts with the electromagnetic field by emitt- 
ing and absorbing radiation of the same frequency. Planck as- 
sumed that the exchange of energy took place in integral multiples 
of an energy quantum 6 ; he at first considered this assumption 
merely as a mathematical device, and intended to pass to the 
limit 6-^0. In order to obtain agreement with the Wien 
displacement law, which was derived from general thermo- 
dynamical principles, the energy quantum associated with a 
definite frequency v must be taken proportional to v\ e ^ hv. 
In this way Planck obtained his radiation formula, which is in 
excellent accord with observation ; according to it the amount 
of energy contained per unit volume in the spectral interval 
Vy V dv in thermodynamic equilibrium at temperature 0 is 


u[v)dv 


Snhv'^dv 
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( 1 . 2 ) 
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where c is the velocity of light and k the Boltzmann constant 
(|/j0 being the mean energy of an atom of a monatomic gas at 
temperature 6). On passing to the limit /t = 0 we obtain the 
Rayleigh-Jeans radiation law 


The assumption of the validity of this latter law for the entire 
spectrum is in gross disagreement with the facts, as it would 

lead to an infinite value for the total energy ^u{v)dv ; a state of 

equilibrium would therefore be impossible with given finite 
energy. 

The idea of a quantized exchange of energy, which occurs 
in Planck’s derivation somewhat schematically and only in 
application to statistical thermodynamical consequences, was 
first seriously applied to individual atomic processes by Einstein. 
In 1905, guided by the observations of H. Hertz, Hallwachs 
and Lenard on the photo-electric effect, he enunciated the idea 
of a light quantum or photon as “ an heuristic viewpoint con- 
cerning the generation and transformation of light ” * according 
to which not only the exchange of energy between matter and 
radiation of frequency v occurs in quanta of amount hv, but 
further, light of frequency y can exist in the ether only in quanta 
of energy kv. The decisive experiments were first performed 
by Millikan ten years later. By allowing ultra-violet or X- 
radiation of frequency v to fall on a metal plate electrons are 
released whose kinetic energy (as was already known to Lenard) 
increases with the hardness (i.e. with decrease of wave-length) 
of the incident radiation : the energy with which the electrons 
are emitted is, however, not influenced by the intensity of the 
radiation. The exact relation predicted by Einstein is 



where — e, tn and v are the charge, mass and velocity of the 
electron, respectively. The energy hv of the photon is trans- 
formed into kinetic energy of the electron, after subtracting 
from it the work P required to pull the electron out of the metal 
surface. If the potential difference between the metal surface 
and a plate placed in front of it is V' the electron current will 

disappear as soon as V' exceeds the critical value — — . 

Millikan found that the potential at which the current vanished, 
obtained by extrapolation, was in fact exactly proportional to 
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the frequency v for monochromatic light of various frequencies, 
and that the constant of proportionality was equal to the 
quotient of the h obtained by Planck from black body radiation 
and the elementary quantum of electric charge e. The differ- 
ence of the mean energy P for two different metals is furthermore 
equal to e times their contact difference of potential. The 
value of P, or at least its order of magnitude, is therefore known, 
and we find that for X-rays of a few Angstroms wave-length 
(lA = 10-* cm.) P is negligible in comparison with hv. The 
equation 

= eV (1.3) 


governs not only the generation of secondary cathode rays by 
primary X-rays, but also the inverse process : the transformation 
at the glass wall or on the anode of the incident cathode rays 
into the impulse radiation first observed by Rontgen. If an 
electron which has run through the potential drop — V in the 
X-ray tube loses its entire energy on collision, a photon of fre- 
quency V and energy hv = eV will spring into existence. The 
electron may, however, only be slowed down ; consequently 
p is only the upper limit for the frequency of the impulse radia- 
tion, which will therefore consist of a continuous spectrum with 

eV 

a sharp limit at v = -j-. The old classical theorv of radiation 


was entirely unable to account for this most characteristic 
property of the impulse radiation. The frequency of the limit 
increases in proportion with the applied potential — and this is 
the exact formulation of the fact that “ the higher the potential, 
the harder the rays ” so familiar to every X-ray operator. 

The observed phenomena thus confirm the hypothesis that 
radiation of frequency v can be absorbed and emitted only in 
quanta of energy hv. This hypothesis will of course have further 
consequences for the theory of the structure of matter. The 
Planck oscillator will, for example, be unable to alter its energy 
continuously since it can only emit or absorb these fixed quanta 
of energy, and it will consequently spring to and fro on the rungs 
of its energy ladder, which are equally spaced at intervals hv ; 
V is here the frequency of the oscillator, a constant determined by 
the constitution of the oscillator. An application of the essential 
elements of this idea to actual atoms gave rise to the frequency 
rule enunciated by Niels Bohr (1913) : 

An atom can exist only in certain discrete stationary states 
(“ quantum states ”) in which it does not radiate. Light will be 
emitted on transition from one state into another ; the energy which 
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it loses in this transition, the difference — E* of its energy in 
the two states, will be transformed into a photon of energy hv, the 
frequency v of which is determined by the equation 

hv = Ei- (1.4) 

In this equation E^, E^ may be any two of the discrete energy levels 
{El >Efj. Conversely, in absorption a photon raises the atom from 
the energy level E, to a higher E^ by giving up its energy hv to the 
atom. 

According to classical electrodynamics an atom should 
continually emit radiation in consequence of the vibrations of 
its constituent electrons, and the frequencies of the emitted 
light should agree with the frequencies of the simple oscillations 
into which the motion of its electronic system can be resolved. 
But the atom will itself lose energy through this radiation, the 
motion of its electrons will thereby be modified and the fre- 
quencies will consequently be displaced. This entire point of 
view is therefore irreconcilable with one of the most fundamental 
physical facts : the existence of sharp spectral lines. On the 
other hand, Bohr’s assumption is not only in agreement with 
this fact, although it offers no such detailed picture of the 
reaction between matter and ether as the classical theory, but 
contains in addition the fundamental Ritz- Rydberg combination 
principle. If we order the energy levels in an increasing series 
E(, < E| < Ej < • • •, then in accordance with (1.4) each 
frequency v is the difference of two “ terms ” v, = EJh, 

v{i -> ^) = V,- — Vfc (t > k). 

Consequently there will occur in addition to the frequencies v{i -> k), 
v{k -> 1) the frequency 

v{i -> /) = v{i ^k) + v{k -> 1) (1.5) 

obtained from them by addition. This combination principle is 
valid without exception in the whole of spectroscopy, in the 
optical region as well as in that of X-rays, and has proved to 
be a valuable guide in the classification of spectra ; it reduces 
the complex line spectra to the simpler term spectra. Un- 
fortunately the problem is made more difficult by the fact that 
not all lines corresponding to possible transitions i k need 
actually occur — not every term v,- need “ combine ” with a 
given term v* — for the conditions of excitation may be such 
that certain lines have zero intensity. The selection rules for 
the allowable transitions will therefore be contained in the 
rules which determine the intensities of spectral lines. The 
combination principle, or the Bohr frequency rule, determines, 
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so to speak, only the keyboard of the spectrum — which tones 
are really struck is dependent on the mode of excitation. But 
it will in general be possible under proper conditions of ex- 
citation, e.g. the influence of strong external electric fields, to 
bring out the lines which are not observed under ordinary 
conditions. 

In the “ unexcited ” or normal slate the atom is in the stationary 
state of lowest energy Eq, and consequently only the lines of the 
“ series ” « -> 0, of frequency v„ — Vo (n = 1, 2, • • •), occur in 
absorption. The lowest of these 1 -> 0 (i.e. with greatest wave- 
length), or more precisely the lowest which is not forbidden by 
the selection rules, is called the “ resonance line.” 

The simplest atom is that of hydrogen ; in it a single electron 
of charge — e revolves about a nucleus of opposite charge -1- e. 
The terms of the spectrum of atomic hydrogen are found by 
observation to be given by the equation 


where R = 109700 cm.“^ is the Rydberg constant (spectroscopists 
are accustomed to give the wave number vjc, the reciprocal wave- 
length, instead of the frequency v). The energy levels corre- 

' Rhc 

spending to these frequency terms are En= — To this 

discrete term spectrum we must add the continuous spectrum 
E '^0 ] the additive constant in the energy is so chosen that 
£ = 0 separates the hyperbolic electron orbits from the elliptic. 
The Balmer series consists of the lines n -> 2 with wave numbers 



This is the oldest known series formula ; Balmer obtained it in 
1886 by abstraction from the first four lines of the series, called 
Ha, //y, 7 / 5 , which lie in the visible region. The lines of 
this series converge with increasing n to a limit with wave 


number ^ ^wave-length ^ is the work required 


to ionize an H-atom in the stationary state n ~ 2, i.e. the work 
required to remove the electron from such an atom without 
leaving it with kinetic energy. The continuous spectrum, 
arising from transitions which ionize the atom, will join on to 
this series limit on the short wave side. We are further ac- 


quainted with the Lyman series n 1 which lies in the ultra- 
violet and also occurs in absorption, the Paschen series n 3 
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lying in the infra-red, and finally with some members of the 
Brackett (n -> 4) and Pfund (n ^ 5) series in the far infra-red. 
In order to ionize hydrogen in the normal state an amount cRh 
of work must be done ; the corresponding “ ionization potential,” 
i.e. the potential difference an electron must traverse before it 
is able to ionize atomic hydrogen by means of its kinetic energy, is 

J/ == — = 13-53 volts. 
e 

Bohr’s frequency rule goes beyond the combination principle 
in asserting that the terms are actually energy levels, an assertion 
irrelevant to and not verifiable by spectroscopy. That this is, 
however, in fact the case is confirmed by the experiments of 
Franck and Hertz on collision phenomena.^ In these experiments 
electrons are given an amount eV of kinetic energy by allowing 
them to pass through an electric field of known potential differ- 
ence — V and are then allowed to pass through a gas consisting 
of the atoms which are to be investigated with the velocity thus 
obtained, without further influence from external fields. The 
electron can give up no energy to the atom until eV is greater 
than the excitation energy — Eq of the resonance line ; if 

E^ — Eo<eV<E,~- E, 

then the electron can either suffer an “ elastic collision,” in 
which case it loses no energy, or it can suffer an ” inelastic 
collision,” in which case it loses an amount E^ — Eq to the 
atom. The electrons which have passed through the gas are 
of two kinds, those with kinetic energy eV and those with 
eV — (-El — Eq). When the atoms which have been raised 
from the state 0 to the state 1 by collision with electrons fall 
back into the normal state they emit the resonance line and, 
under the above conditions, only this line. This is fully con- 
firmed by the experiment. The kinetic energy of the emerging 
electrons is measured by introducing a retarding potential F' ; 
the electrons only come through it if their energy is greater 
than eV\ In general the electrons possess a discrete ” energy 
spectrum ” after collision with an atom of the gas ; the possible 
energy values are 

.Fn' - ^F -- (E„ - Eq) 

(n = 0, 1, 2, • • •, in so far as F/ is still positive ; we here dis- 
regard the possibility that a single electron may suffer more than 
one inelastic collision). On allowing the retarding potential F' 
to decrease gradually from a value which is greater than F the 
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electron current decreases suddenly whenever V passes through 
one of the values FJ,, V'l, • * *. 

Bohr’s frequency rule reduces the determination of spectra 
to the problem of obtaining the stationary states and the correspond- 
ing energy levels of an atom, i.e. of a mechanical system of known 
dynamical constitution. The example of the linear oscillator 
given above and the fundamental notions of the theory of 
oscillations suggest the following as a general guiding principle 
(P) : the frequencies derived from the energy levels by means 
of Bohr’s frequency rule shall correspond to the frequencies of 
the simple vibrations into which the actual motion of the atomic 
constituents can be resolved in accordance with the laws of 
dynamics. Such a resolution into simple oscillations is con- 
vincingly attainable in classical mechanics only if the system 
is “ multiply ” or “ conditionally periodic,” and for this case it 
was actually found possible to sharpen the general principle [P) 
into a definite rule for quantization. In the years 1913-25 the 
application of this quantum rule yielded a great harvest of 
results, and it seemed that we were in possession of the key that 
would unlock the mysteries of atomic processes. But the wards 
did not quite fit ; toward the end of this epoch its failure became 
more and more apparent and the physical theory was gradually 
reduced to a symbolic calculus of quantum numbers which had 
to be corrected each time a new fact was discovered. We do 
not wonder now that it ran such a course, but rather are surprised 
that it was as successful as it was ! 

From the beginning the quantum rules were a compromise. 
If a mechanical system of one degree of freedom undergoes a 
periodic motion the frequencies v of the simple vibrations into 
which its motion can be resolved are integral multiples of a 
fundamental frequency <o. This frequency depends on the 
energy of the orbit under consideration, and this latter is re- 
stricted by the quantum rules to the discrete set The 

internal frequencies of the motion are therefore given by the 
formula 

V = k ‘ u)[n) (L'^) 

which depends on the two integers n and k. By the analogy 
with quantum mechanical frequencies this internal frequency 
(1.7) is to be ascribed to the jump n -> (n — k). The fact that 
V depends linearly and homogeneously on the jump k is expressed 
by the ‘‘ classical combination principle ” 

v{n n — /z) -f v{n n — 1) = v{n n — k — t) (1.8) 



48 QUANTUM THEORY 

in consequence of which frequencies with the same initial state 
n will combine. But this is not in accord with the correct 
combination principle 

v(n -> n — /?) + v(n — k->n — k — 1) = v{n n — k — 1) (1.9) 

The changes k, I in the quantum number are here the same as 
in (1.8), but the final state n — k oi the first frequency coincides 
with the initial state of the second ; only for quantum numbers 
n which are large compared with k and I does the classical 
principle agree asymptotically with the Ritz-Rydberg com- 
bination principle. Consequently if the general principle (P) 
is to be satisfied without compromise our mechanics must be 
altered in such a way that the false combination principle (1.8) 
is replaced by the correct one (1.9). In 1925 Heisenberg dis- 
covered a way in which such an alteration can be naturally 
accomplished ; in order to do this, however, it was necessary 
to give up the picture of an atom with its electronic orbits. 
The quantities with which the Heisenberg theory deals are 
only the frequencies and intensities of radiation associated with 
transitions between the various states of the atom. 

It should be observed that the correct combination principle 
(1.9) is in one important respect simpler than the false one (1.8). 
As the formulation 

v[n" — >• n') -|- v{n' -> n) = v(«'''— > n) (I-IO) 

shows, the quantum numbers serve only as distinguishing marks 
or indices which do not involve a law of composition, whereas 
the classical formula requires the addition of quantum numbers, 
which are therefore numbers on a definite scale. 

Another approach to quantum mechanics was discovered 
by L. de Broglie and E. Schrodinger* This approach seems to 
me less cogent, but it leads more quickly to the fundamental 
principles of quantum mechanics and to the most important 
consequences for experimental science. We shall therefore 
follow it, since we are more concerned in giving a short but 
comprehensive account than in giving a complete discussion of 
the physical foundations. The physical, essentially statistical, 
interpretation of the theory, with which Schrbdinger has not 
been entirely in accord, is due mainly to M. Born. 

§ 2. The de Broglie Waves of a Particle 

We consider the undulatory character of light as guaranteed 
by the phenomena of diffraction and interference. Their most 
decisive feature is that with them we are dealing with the linear 
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position of waves with arbitrary differences of phase. From 
lathematical standpoint, they are characterized by the fact 
they involve addition and multiplication with complex 
icrs, and we are consequently dealing with vectors in a 
lex space. We can, in fact, consider a complex function 
:yz) employed in the description of the phenomena and 
id over time and space as such a vector, where each space- 
point represents one dimension of a complex vector space ; 
ifferential laws for such a wave function tjt — or for several 
functions simultaneously, such as the components of the 
ic and magnetic field strengths — are linear and homo- 
lus. But on the other hand the quantum phenomena 
1 we discussed above speak just as plainly in favour of 
orpiiscular nature of light. The intensity of the mono- 
natic radiation employed in the production of the photo- 
ic effect has no influence on the velocity with which the 
ons leave the metal ; it influences only the frequency of 
ivent. Even with intensities so weak that on the classical 
y hours would be required before the electromagnetic 
;y passing chrough a given atom would attain to an amount 
. to that of a photon, the effect begins immediately, the 
s at which it occurs being distributed irregularly over the 
j metal plate. This constitutes a proof of the existence of 
)ns which is no less direct than the proof that a-particles are 
rpuscular nature by observing the scintillations caused by 
on striking a sensitized screen. Further, if one considers 
exchange of momentum in addition to that of energy in 
ing the laws of black body radiation, conflict with Planck’s 
thesis concerning energy quanta can be av'oided only by 
ning that in addition to the emission of the energy quantum 
quantum hvjc of momentum is emitted in a definite direction, 
Licing an equivalent reaction on the atom.® We here replace 
ontinuous radiation of a spherical wave by the discontinuous 
iion of photons in definite directions which are irregularly 
ibuted over the compass. 

/e unite the two standpoints by retaining the linear wave 
ion, but considering the intensity as the relative probability 
the photon appears at the point {x, y, z) at time t ; or, more 
sely, that 


ft fjt dxdydz (2.1) 

e probability that at time t it will be found within the small 
llelepiped with sides of length dx, dy, dz about the point 
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{x^ y, z)* But we can only expect to arrive at a rational theory 
if we deal with material particles in the same way as with photons. 
This point of view was developed in the Bose-Einstein treatment 
of an atomic gas, which paralleled that employed in the theory 
of black body radiation (“ light quant gas ”).® Schrodinger's 
researches took as their point of departure the Hamiltonian 
theory of mechanics, which was originally obtained by Hamilton 
himself from an analogy with geometrical optics. He argued 
that since we replace geometrical optics, with the aid of which 
interference and diffraction cannot be treated, by wave optics, 
it is reasonable to attempt the analogous transition in mechanics. 
The results amply justified the attempt. The investigations of 
Davisson and Germer^ which prove the existence of interference 
in beams of electrons reflected from a crystal lattice, were already 
in progress when de Broglie published his theory. The experi- 
mental evidence that moving material particles behave in much 
the same way as a beam of light with respect to these phenomena 
is now fully established, and with no less certainty than for 
X-rays, by a series of further investigations by the same 
authors and by G. P. Thomson^ F. Rupp and others.’ The 
real difference between “ light-like ’’ and ‘‘ electron-like ” beams 
lies in the fact that the particles composing the latter possess 
charge and proper mass and can consequently be deflected by 
electric and magnetic fields. 

A simple oscillation is one in which the function «//, defining 
the state of the system, depends on the time in accordance with 
the law 

a • e~^^^ (2.2) 


where a and v are independent of t. [We choose as our unit 
of angular measure that one which proves most useful in differ- 
ential calculus, for it yields the simple relation 


1 de{x) 
i dx 


- e{x) 


(2.3) 


for the fundamental trigonometric function = e{x). The 
sum of the angles about a point is then 27t ; it would, admittedly, 
be more correct from the integral standpoint to take this as 1, 
but then the factor 2tt would appear in the differential relation. 
vI2tt is the number of oscillations in unit time ; we shall not 


* Just as in the classical wave theory we have an expression for the flow 
ot energy in addition to its density, so in the more refined formulation of 
quantum theory we will have an expression for the probability that the 
photon passes through a given element of surface (“probability current") in 
addition to one for the probability that it be found in a given element of 
volume (“probability density"). 
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hesitate, however, to use the name “ frequency ” for v. If we 
deyiote Planck's constant of action by 2nh instead of h, and we shall 
throughout the present work, the fundamental formula (1.1) 
will still be valid in the new nomenclature.] In accordance with 
(2.3) the simple oscillations (2.2) are the characteristic functions 
of the linear Hermitian operator which carries over into 

— ^ ^ corresponding characteristic numbers are the 

energies E hv. If the dependence of a state of the system on 
time is described by a superposition of simple oscillations 

ifj{t) — + • • •, (2.4) 


the energy is capable of assuming only one of the values 
hv 2 y • • •, and we shall take the intensity drOr = | of the 
oscillation of frequency Vr in ip as the relative probability that 
the energy is observed to be The relation E ~ hv is accord- 
ingly to be interpreted : if v is indeterminate because an entire 
spectrum of frequencies v is contained in the oscillatory process^ then 
the energy is indeterminate to the same extent ; the intensities 
with which the various simple oscillations occur in the process 
measure the probabilities of the corresponding energies. The 

operator “ ^ • 4 represents the energy : 

X at 


H-> - 


i dt 


(2.5) 


in the following sense : a characteristic function of (2.5) represents 
a state in which the energy assumes a definite value E with certainty. 
This value is the corresponding characteristic number ; in axi 
arbitrary state the components a of ivith respect to these character- 
istic functions determine the relative probabilities da of these 
values E. 

According to the theory of relativity energy is to be con- 
sidered as the time component of a 4-vector whose spatial com- 
ponents constitute the linear momentum p ^ (pa., py^ pf). The 
fundamental metric invariant of the two vectors running from 
the origin to the points (^, xy^^ {t\ x'y'z') is the scalar product 

c^tf — [xx' + yy' + zz). 

Under a Lorentz transformation, which transforms from one 
space-time co-ordinate system to another equally permissible 
one, the quantities 

cH, -x,—y,-z 

must consequently transform contragrediently to I, xyz] they 
are therefore the components of the vector associated with 
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(t, xyz) in the space which is the dual of the 4-diniensional space- 
time world. Such a dual vector is given by 


Pxt Pv> Pz I 

or, what amounts to the same thing, 

Hdt — {pidx + Pvdy + p,dz) 

is invariant under Lorentz transformations. The same is true 
of the total differential operator 




applied to an arbitrary function of / ; x, y, z. Hence the corre- 
spondence (2.5) necessarily implies the further relations 


Pz 


h 7) A 

t ^ i 


hi 
i iz’ 


(2.6) 


which are to be given the analogous interpretation. 

A homogeneous plane wave 

^ = a • + + + (2.7) 

is simultaneously a characteristic function of the four mutually 
commutative operators (2.5), (2.6), which has as characteristic 
numbers 

H = hv] px = hoL, py — hp, = hy. (2.8) 


It represents a state in which the energy and linear momentum 
of the quantum possess these sharply defined values. 

In classical mechanics the laws governing the motion of a 
particle are known as soon as we express its energy H in terms 
of the “canonical variables” xyz, PxpvPt- In Newtonian 
mechanics the Hamiltonian function for a free material particle 
of mass nt is 

H = ^ ; (2.9) 


on employing the transition scheme developed above we obtain 
the corresponding wave equation 


h ^ 

t it 


2m‘ 




D* , a*' 




(2.7) is a solution of this equation provided the values (2.8) of 
energy and linear momentum satisfy equation (2.9) ; in this 
sense (2.9) and (2.10) are equivalent. But the equation (2,10) 
is linear and has as its most general solution a linear super- 
position of simple waves (2.7) ; such a superposition corresponds 
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to a state in which the energy and momentum of the particle 
assume their various permissible values “ with a certain definite 
probability.” 

The space vector (a, /3, y) in (2.7) gives the direction of 
propagation of the plane wave, and the modulus of this vector 
is the wave number /a (the number of waves contained in 27 r 
units of length ; 27 r//i is the wave length A). Hence by (2.8) 

the absolute value p of the momentum is equal to hfi = -y-. 


V 


is the phase velocity of the wave ; in accordance with (2.9) or 


V = 


2m 


'H- 


2 


it is hfil2m = hirlXm and depends on the wave length or frequency 
(dispersion). Since p — mv, where v is the velocity of the 

particle, the ‘‘ group velocity ” ^ ^ ~ ^ coincides with the 

velocity of the particle. Experiments on diffraction and inter- 
ference phenomena in electron beams, such as those performed 
by Davisson and Germer, have made it possible to test directly 
these relations set up by de Broglie. 

In relativistic mechanics we have in place of (2.9) an equation 
which states that the square of the absolute value of the energy- 
momentum 4-vector is constant and equal to mh^ : 

-^-{pl + pi + Pi) = >»v (2.11) 

or 

H = cs/mh^ + (pi + + Pz)- 


For the transition to a wave equation it is of advantage to employ 
the rational form (2.11) of this expression : 


. t)% . . , ntV 


( 2 . 12 ) 


Here again the group velocity is equal to the velocity v of the 
particle, but the phase velocity is found to be c^jv ; the former 
is always less, the latter always more than the velocity of light. 
In order to return from the relativistic to the ” ordinary ” or 
Newtonian mechanics by passing to the limit c -> oo, we must 

ll~} ' 

The differential equation governing light waves can be ob- 
tained from (2.11) by dropping the term on the right-hand side. 
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Hence from the corpuscular standpoint light consists of photons 
or particles of proper mass 0 : 

^ + Pl) “ 0. 

In accordance with the expression (2.1) for the probability 
density, we are to consider as the vector in unitary system-space 
describing the state of the system the function ib in so far as it 
depends on the spatial co-ordinates xyz. The integral of (2.1) 
with respect to the spatial co-ordinates gives the probability 
that the particles will be found “ within the volume V at time 
Space and time must be separated from one another ; the system 
has at each time t a definite state ifj{xyz), which will in general 
vary with t. The operators which represent physical quantities 
must accordingly be ones which operate on an arbitrary function 
of the spatial co-ordinates. This requirement is satisfied by 
the operators (2.6) corresponding to the momentum co-ordinates, 
but not by differentiation with respect to time, which we have 
associated with the energy. We must instead consider the 
situation as described as follows : from the expression for the 
energy in terms of the canonical variables pj.^ py^ p^ we obtain 
the operator // which represents the energy and which operates 
on the function ilf{xyz). The equation 

is then the dynamical law which determines the change in the 
state ifj in time. 

The separation of space and time offers certain difficulties 
to the development of quantum theory from the relativistic 
standpoint ; consequently, for the present, we base our develop- 
ment on the Newtonian mechanics. 

Our procedure must eventually be modified in another 
important respect : we have here tacitly assumed, for the sake 
of mathematical simplicity but without physical justification, 
that the wave field of a material particle is described by a scalar 
quantity ip. The modification, which is required in order to 
give an adequate description of the facts of spectroscopy, will 
be made in Chap. IV. 

§ 3. Schrddinger’s Wave Equation. The Harmonic 

Oscillator 

When the particle is moving under the influence of forces 
the kinematic part (2.9) of the energy is augmented by the 
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potential energy, which usually depends on the co-ordinates 
alone and not on the momenta. We must therefore know 
which Hermitian operator acting on ip corresponds to the co- 
ordinate X. I assert that it is multiplication by x ; this operator 
is already referred to its principal axes, its characteristic values 
are all real numbers x and finally >p{x), or more precisely ip{x)Vdx, 
is the component of the “ vector ” associated with the character- 
istic number x (we have here ignored the other co-ordinates y, z). 
In accordance with the statistical interpretation of the relation- 
ship between physical quantities and operators, our assertion is : 

the probability that x has a value between x^ and x^ is ^ipipdx ; 

this is in agreement with the expression (2.1) for the probability 
density. If V(xyz) is a function of position in the 3-dimensional 
space, e.g. the potential energy, then the physcial quantity V 
is represented by the operator 

^(xyz) • <A. 

for the probability that V lies between Vi and F 2 is given by the 
integral 

III iJjifjdxdydz 

extended over that portion of space in which Vi ^ V{xyz) ^ F 2 . 

The operators corresponding to x, y, z commute with each 
other, but the operator Q corresponding to x and the operator 
P corresponding to do not. In fact 

sW(*)l - = '^(*) 

or PQ-QP = il 

I 

where the 1 on the right-hand side stands for the operator 
identity: i[j{x) i/j{x). Because of this non-commutative re- 
lation between the operators P and Q, pj. cannot assume a definite 
value with certainty luhen x does^ and conversely. In fact, if pj^ 
is known to have the value hoL with certainty, then the dependence 
of ^ on a; is given by the factor ; in consequence of this the 
position X of the particle is entirely indeterminate, since the 
probability of localization is the same for all points x. 

If V(x, y, z) is the potential energy of the field in which the 
particle moves, the total energy is 

" = y' =') 
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We assume with Schrcdinger that in spite of the fact that all 
our variables do not commute we may still apply our rules for the 
formulation of the wave equation ; we thus obtain Schrodinger’s 
differential equation 

We understand by “ stationary ” or “ quantum states ” tp those 
in which the energy E has a definite value ; they are character- 
ized as solutions of the wave equation which satisfy in addition 
the equation [cf. (2.5)] 

I it ^ 

On setting E — hv^ such a iff will have the form • ift where 
the new function denoted by ift is independent of t. This function 
if/{xyz)^ which depends only on the spatial co-ordinates, satisfies 
the reduced equation 

^64 + [E - Vixyiiy, = 0. 

The problem is thus reduced to finding values of E and functions 
^ 4= 0 of position which satisfy this equation and are such that 
the integral oi over the entire space is finite. They are the 
characteristic numbers and characteristic vectors of the Hermitian 
operator H associated with the energy (3.1) in the function space 
of all functions of position The characteristic numbers E 
are the possible energy levels of the particles. 

Before going any further into the interpretation of the theory 
we have developed, it will be well to convince ourselves that it 
leads to energy levels which are in agreement with the facts. 
The simplest example is that of the linear oscillator ; with it 
we are dealing with only one co-ordinate x. The potential 

energy is V{x) = ^x^ and the total energy 

The equation for the determination of the characteristic values 
E and the associated characteristic functions tff is 
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Hermitian polynomials. The solutions of this equation are 
expressed in terms of Hermitian polynomials. The Her- 

mitian polynomial r]n{x) is defined by the equation 

= (3.4) 

it is of degree and the highest term is exactly x'*. The 
r)„{x) (n = 0, 1, 2, • • •) constitute an orthogonal set of functions 
with the “ density function ” 

4- oo 

\e-^'l^rjn{x)r]n{x)dx = 0, w n ; (3.5) 

- 00 

the functions 

4,„(x) = e-^J*’Vr,(x) 

are consequently orthogonal in the ordinary sense. To prove 
this we need merely to note that 

+ 00 

(- ■ V>n{x)dx 

— 00 

becomes, on integrating n times by parts, 

J dx'^ 

— 00 

and the integrand vanishes for m < n. For m n we obtain 

4-00 

n ! ^e-^'l^dx 
— 00 

so the equations (3.5) can be supplemented by 

4 00 

^e'^*^^7]l{x)dx = n! V2Tr, 

— 00 

From (3.4) we have 

• Vn+lix) = - (- 

and we can consider as either ) or Since 


^(^-XV2) == . ^-*V2 



68 

and 


QUANTUM THEORY 


= x^^{e + n-^,{e-^V^), 


the first of these interpretations yields the recursion formula 

’?n+iW = xr]„{x) — n-q„^i{x). (3.6) 

From the second we find 

— ' Vn+l{x) 

or 

Vn+li^) ^ + ^Vn{x). (3.7) 

On subtracting the recursion formula (3.7) from (3.6) we find 
the simple relation 

( 3 . 8 ) 


Differentiating (3.7) and substituting (n + l)i}„ for the derivative 
of in accordance with (3.8), we obtain the differential equation 


dx^ 


Mn 

dx 


+ = 0 . 


The equation for ^„(^) is consequently 

^ - |V« + (n + = 0. (3.9) 

On going over to a new unit of length by the substitution 
x = a^, the left-hand side of (3.3) is equal to the left-hand side 
of (3.9) multiplied by /i^/2wa^ provided 

1 _ / I 1\ _ n- 

2wa® 4 2 ’ 2/ 


Let o) — V ajm denote the classical frequency of the oscillator. 
The first of these conditions determines the new unit of length a : 

, h h 

^\/<iTn 2mo)’ 

and the second requires that 

E^E„ = hco{n + i). (3.10) 

It is possible to show that the <^„(|) constitute a complete ortho- 
gonal system,® and consequently there can exist no further 
characteristic numbers and functions. The oscillator possesses 
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the discrete energy levels (3.10) at intervals hoj apart. That the 
lowest energy level turns out to be Mco instead of 0 is of itself of 
no significance, as we may always introduce an additive constant 
into the energy, although it is meaningful to assert that the least 
possible value of the quantity //, (3.2), is equal to \hix). 

However, the wave equation not only yields the energy levels 
as characteristic values, but it also gives us information con- 
cerning the probability of localization by means of the character- 
istic functions. For convenience we now take a as the 

unit of length. When the oscillator is in the state described by 
the energy levels the probability that the oscillating particle is 
at a distance x from its position of equilibrium is given by 
These probabilities are to be understood as 
relative, and refer to equal infinitesimal intervals about the 
points of comparison x. In particular, for the lowest energy 
level n = 0 the probability density is we can therefore 

no longer say that the mass-point is at rest in the position of 
equilibrium, but rather the probability of its displacement from 
this position is given by a Gauss error curve. The normalized 
characteristic functions of (3.3) are given by 

Yn 

On expressing any function *p{x) of position in terms of this set 


^{x) 


CO 


~ Z Xrt'Pnix), 

n -0 


-f C30 

X„ =- l'p{x)>p„{x)dx, 

— 00 


and the operator belonging to the energy H is, as we have already 
seen, expressed in terms of these co-ordinates by 


Xy^ — > hco{}l "b ^) * ^rx' 


In order to find the operator associated with the co-ordinate x 
we must express xip^i^) linearly in terms of the characteristic 
functions themselves ; by (3.6) we have 

X(f>n i>n + i + 

whence 


Xtpn = '/'nfl + "An-l ^ ^ 'f’n+1 + Vn 

The correspondence tp{x) -> xip{x) is thus expressed in terms of 
these Fourier coefficients by 

-> Vnx„_i + Vn -f 1 x„^i ; 



60 QUANTUM THEORY 


its matrix ll^„m|| contains only the elements 


qn.n-i=Vn, ?„.„+!= Vn+1. (3.11) 

(On returning to the original unit of length the right-hand side 
must be multiplied by the factor a.) On applying the operator 

— to <f>„ we obtain, in accordance with (3.8) and (3.6), 


d(f>n 

dx 




+1 


whence 

^ = UVn Vn -f l^/'„^.^). 


The linear Hermitian correspondence associated with the mo- 

h d . ... 

mentum P ~ J accordingly 

-> ^(— V n X„_i + Vn + 1 Xn+i) ; 

its matrix |l/>„„i| has as its only non-vanishing elements those 
for which m = n ± 1 : 

Pn, n-i = - ^-Vn, />„,„+! = ^.Vn + 1. (3.12) 

(On returning to the original unit of length these elements are 
to be multiplied by 1/a. — Terms with the index « — 1 are to 
be omitted when n = 0 ; in fact, they automatically drop out 
of the above formulae.) 


§ 4. Spherical Harmonics 

In order to discuss the energy levels of an electron in a 
spherically symmetric electrostatic field we must first discuss 
spherical harmonics and their principal properties. 

1. Definition. — Let r denote the distance from the origin in 
the 3-dimensional space with co-ordinates x, y, z, and let r, d, <f> 
be polar co-ordinates with polar axis along the positive z 
direction : 

X iy = r sinde'"^, z = r cos 9. 

On setting a homogeneous polynomial u of degree in x, y, z 
equal to r* • F, , Fj depends only on the directional co-ordinates 
0, ^ and is a function of position on the unit sphere. If u is 
a harmonic function, i.e. if it satisfies the equation Am = 0, 
Fj is said to be a stcrface harmonic of degree I and the harmonic 
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function u itself is said to be a spherical {or solid) harmonic of 
degree 1. Since in polar co-ordinates 


Au 

Au 



1 

sin 0 


(4.1) 


the surface harmonic Fj satisfies the differential equation 

ylF,-f /(/+1)F.= 0. (4.2) 

2. Orthogonality . — On applying Green’s formula to the 
spherical harmonics u — r^Y k, v = YYi on the interior of the 
unit sphere, we obtain the orthogonality relations 

\Y^Y^da. = Q, k^l, (4.3) 

in which dii> — sm0d0d<f> is the surface element on the unit sphere. 
Since the conjugate complex F* of a surface harmonic is also a 

surface harmonic, the first factor in (4.3) can be replaced by F*. 

3. Basis . — On writing 

^ = X + iy, rj = X — iy 


the differential equation Au = 0 becomes 




we see that a homogeneous polynomial u of degree / in t), z 
breaks up into harmonic polynomials «(”*) : 

u — I!u^”‘), {m — — I, ' • •, I — I, 1) 

where consists of all terms in which the exponents of ^ and 
7] have the fixed difference m. The recursion formula for the 
coefficients of u^”'\ which is obtained from the differential 
equation Au — 0, further shows that there exists one, and to 
within a multiplicative constant only one, such harmonic u^^'K 
Accordingly, there exist exactly 2/ + 1 linearly independent 
surface harmonics of degree I ; we may take them to be the 
F<7^ defined by 

Writing 

«("■) ={x- iy)-”* •P={x + iy)”' • P* 


and r placing 


{x + iy){x — iy) by r* 
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P and depend only on r‘‘‘ and z. Hence on taking r = 1 
we have 

Y(m) _ gimi, (sin0)“”* • (cos d). (4.4) 

For tn = — I we take P — and for m = I, P* = 1 ; 
P{z) = (1 — z^'Y for this latter case. Since depends on 

<f> only in the factor 

j _ 0. m’ m. (4.6) 

This basis which the 2 - axis occupies a preferred position, 

is accordingly unitary-orthogonal. 

4. Completeness. — That the totality of surface harmonics 
constitute a complete orthogonal system on the unit sphere can 
be proved by showing that any polynomial in y, z on the 
sphere can be written as a sum of surface harmonics. Now 
the general polynomial of degree / contains 

(/ + 1 ) + ^ + (^ — 1 ) + * * • + 1 
arbitrary constants. But exactly this same number of linearly 
independent homogeneous polynomials are contained in the 
expression 

r^Yi + yt-2 + • • •)[= +•••]> (4.6) 

for the polynomials of the form r^Y linearly 
independent in virtue of the orthogonality of surface harmonics. 
r^Y I contains exactly 2/ + 1 = + 1) + ^ linearly independent 

functions, and consequently (4.6) contains exactly 


[(/ + 1) + /] + [(/ - 1) + r/ - 2)] + • • •, 

as asserted above. 

5. Closed expressions for the surface harmonics . — On sub- 
stituting (4.4) in (4.2) we obtain the differential equation 


(1 


d^P 


I).f +[;(/ + !) 


m{m — 1)] ■ P = 0 
cos d. From this equation 


for the polynomial P = P^“^ in z 
dP 

we find that ^ satisfies the same differential equation on re- 
placing m by m — 1 ; we thus obtain the recursion formula 

(fP<’") 


p(m-l)(2) _ 


dz 


P^”l\z) 




z^y. 


and the expression 
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In particular, the “ zonal harmonic ” 

p,(2) . pi?>(s) = 


6. Further formulce . — 

\xYkyidiv=-^ (4.7) 

unless I k ~ ±1. For x*r^Yj^ is a polynomial of degree 
k -\- 1 and may, in accordance with 4, be expanded in the form 
+ ^k-i + • ‘ *)• Consequently on the unit sphere 

+ - • • (4.8) 

and the only values oi I '^k for which the integral (4.7) can 
have a value other than 0 is / — /^ + 1- Hence our assertion 
(4.7) ; it also follows from the above that only the first two 
terms can appear in (4.8). 

Further, we shall also have occasion to use the differential 
expressions 

v)’ 

L^u — Lx{Lj^u) + Ly{Lyu) + L^{L^u) 


in terms of polar co-ordinates. On setting in 
, 'bu j , bu j , t)w, 

du — —dx + ~dy -f- ~dz 
bx by ^ bz 

the changes dx^ dy, dz obtained by allowing (f) to increase by 
d(f> and holding r, 0 fixed, we obtain immediately 

L.U = 1 (4.10) 

Similarly, 


^ sin 


L. 


iLy 

U = — A [eq. (4.1)]. 


i . cos 9 i 

id ^ sin 9 i<f> 


)■ 


(4.10) 


§ 5. Electron in Spherically Symmetric Field. 
Directional Quantization 

Now back to physics ! Consider an electron of charge — e 
revolving about a fixed nucleus of charge Ze situated at the 
origin. For Z — 1 we have the hydrogen atom, for Z = 2 
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singly ionized helium He'*', for Z = 3 doubly ionized lithium 

Ze^ 

Li^"' , etc. The potential energy is V = — ; we shall, 

however, for the present take V(r) more generally as any function 
of the radius r. The wave equation for the determination of 
the energy levels is then 

+ [£ - vm = 0. (5.1) 


On expanding in terms of surface harmonics becomes a sum 
of terms fi{r)Yi (/=0, 1, 2, • • •). The differential operator 
on the left-hand side of (5.1) sends the /*** term of this sum into 
F, times 


d( dfA /(/+!) / 

ImXr^drX dr) ■'* 


+ [£ - F(r)]/.(r). 


(5.2) 


Consequently each individual term must satisfy the differential 
equation separately ; we thus obtain a complete set of char- 
acteristic functions of the form 


The factor /,(>•) depending only on r must be such that (5.2) 
vanishes and ^r^fi{^)fi{^)dr converges. Denoting the char- 
acteristic numbers and characteristic functions of this differ- 
ential equation by 

Em, fni{r) {n 0 , 1 , 2 , • • •), 

E„i is a (2/ -j- l)-fold energy level, as the expression /„i(r)F, 
contains 2/ + 1 linearly independent characteristic functions 
associated with this single characteristic value ; we may choose 
as a basis the functions 

= fjr) • F<”|> (m 1, /). 


We thus arrive at three integral quantum numbers ; the 
“ radial quantum number ” n, the “ azimuthal quantum number " /, 
and the “ magnetic quantum number ” m. The energy level 
depends only on the first two. 

In justification of this nomenclature we determine the angular 
momentum /iS of the electron with components 
/jLx = — zpy, • • •. 

In quantum mechanics Lx, Ly, L, are the operators (4.9). 
Hence for 

= fm{r) • (a function of r and 0) (5.3) 
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we have, in accordance with (4.10), 

L^ip ~ m ’ ijt, 

and for the general characteristic function 

•P = fni{r)Yi (5.4) 

with azimuthal quantum number I 

= I {I H- 1) • ip. 

Hence in the state described by (5.4) not only tlie energy has 
a definite value E„i, but also the absolute value of the moment 
of momentum 

£2 = /(/ 4 - 1 ) |. (5-5) 

The significance of the azimuthal yiumber is that it fixes this 
magnitude. It is indeed remarkable that there exist states 
/ = 0 , M = 0 , 1 , 2 , • • • with spherically symmetric character- 
istic functions ip — fno{^) for which the moment of momentum 
vanishes. In the states described by (5.3) not only the energy 
and the absolute value of the moment of momentum have 
definite values, but also the z-component of the moment of mo- 
mentum assumes a definite value with certainty, for then 

L, m. . (5.6) 

Since a magnetic dipole moment 



is associated with the angular momentum /t£ of the revolving 
electron (the mass of the electron being denoted by (x, whenever 
there is danger of confusion with the magnetic quantum number 
m), the influence of £ will be felt on subjecting the atom to a 
magnetic field. The existence of the Zeeman effect under such 
conditions can be traced to this cause. A fundamental ex- 
periment to observe the magnetic moment of the electron directly 
is due to Stern and Gerlach. Let a stream of one-electron atoms, 
which are all moving in the direction of the A;-axis and are in 
the state (n, 1) with energy level F„i, be subjected to an in- 
homogeneous magnetic field in the direction of the a-axis. Let 
the X- and y-components of the magnetic field vanish in the 
(A:- 2 )-plane, in which the beam moves, and let the 0 -component 
be a function of z alone. A magnetic dipole, the 0 -component 


of whose moment is s„ is then acted upon by a force 


in the positive 0 -direction. In consequence of (6.6) the atomic 
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beam should be broken up into 2/ + 1 smaller beams by the 
force in the s-direction, corresponding to the various values 
m = I, I — I, ' ' — / of the magnetic quantum number. 

On performing the experiment on silver atoms in the normal 
state two beams, corresponding to w = ± 1, were observed ; 
the value of the “ Bohr magneton,” the elementary magnetic 
moment corresponding to one unit of angular momentum, was 

gh 

found to agree with the value ^ obtained from (5.6) and (5.7). 

¥ 

Why the unperturbed beam corresponding to m = 0 did not 
appear remained unexplained. 

The older quantum theory, which employed the quantum 
number k — I + 1 with values 1, 2, • • -, allowed m to assume 
the integral values from — k to + k ; it seemed plausible to 
exclude the case k -- 0, although one was thereby led into 
difficulties on applying the so-called ” adiabatic hypothesis ” 
to the behaviour of an atom under the influence of crossed 
electric and magnetic fields. In the new quantum theory no 
ad hoc hypothesis is required for this exclusion, as / can assume 
only the values 0, 1, 2, • * *. But according to either the old 
or the present scalar wave theory there should exist an odd 
number of permissible values of m for given kov l\ the exclusion 
of the case wt = 0 apparently required by the Slern-Gerlach 
experiment cannot be accounted for on cither theory. Nor 
can we explain the related fact that in the anomalous Zeeman 
effect m may assume either an even or an odd number of values, 
according to the nature of the atom under consideration. 
Obviously something is lacking in our present scalar wave 
theory as well as in the older formulation ; we return to this 
point again in Chap. IV, § 4. The older quantum theory 
described the situation met above as ” directional quantiza- 
tion ” ; since the absolute value of the moment of momentum 
was hk and the component along the 2 -axis was hm, it concluded 
that the magnetic axis of the atom could assume only positions 
described by the inclination 6 with the 2 -axis determined by 
the formula 


cos 9 — {m — 0, i 1, ± 2, • • •, ± /z). 

Thus in the case ^ = 1 we should expect only three possible 
orientations for the magnetic axis : parallel and anti-parallel 
to the field, w'hich we have taken in the direction of the 2 -axis, 
and perpendicular thereto — unless we empirically exclude this 
latter possibility m = 0 because of the Stern-Gerlach experiment, 
in which case we have but two. In either case we find ourselves 
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faced with a serious dilemma, for the direction of the 2 - axis is 
an arbitrary direction in space. In order to avoid this one 
then assumed that the quantization was due to the influence of 
the magnetic field, and consequently the preferred ^-direction 
was interpreted physically as the direction of the magnetic field. 
But even so the difficulty is not avoided in the limiting case of 
vanishing magnetic field, for the directional quantization should 
be maintained in arbitrarily weak fields. Or stated more 
physically, the radiation mechanism required by the Stern- 
Gerlach effect for the orientation of the atoms, which were 
originally in random orientation and precessing about the 
2 -axis, requires about 10® times as long as the greatest time 
consistent with the observations. The stand taken by the new 
quantum theory on this point is fundamentally different. The 
possible states (?i, /) of the atom are described by the functions 
ifj of the (2/ -f- l)-dimensional linear family 

-I 

or by the vectors of a {21 + l)-dimensional space with com- 
ponents llie z-component of the ynoynent of momoitum, as 

well as the compoioit /;/ a;/y arbitrary direction, is capable of 
assuming only the discrete values hm (m ~ / — 1, • • *, — /). 

But in a state in which the z component, for example, assumes 
the value hm with certainty there is only a certain probability 
that any other component will assume a definite one of its 
possible values h *0, /t * (± I), * ' % * (db 0* name 

“ directional quantization ” is hardly an appropriate description 
of this situation.^ 

When the electro-static central force satisfies the Coulomb law 
and originates in a nucleus of charge + Zc, the differential 
equation (5.2) for the “ radial characteristic function ” / == /ni(^) 
becomes 

(‘^ - ^ = 0. 


The character of this equation is unchanged on going over to 
the new dependent variable v defined by rf = c"*’’ • v : 


dh) 

d? 



+ 



2niE\ 2mZe^ 

~W~ 


l{l + 1 ) 


|r» — 0. 


We choose a in such a way that the constant term in the co- 
efficient of V vanishes : 


/i®a* = — 2mE. 


(5.8) 
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We know from the general theory of linear differential equations 
that there exist solutions of this equation in the neighbourhood 
of the (regular) singular point r = 0 in the form of a power 
series 

v = S 

in which the exponent fi begins with a certain value /Xq, which 
need not be an integer, and runs through the values /xq, /x^ + 1 , 
/Xfl + 2, • • On substituting this power series into the equa- 
tion we find the recursion formula 

+ 1) - l{l + !))»„, = (6.9) 


for the coefficients a^. In order that it be satisfied for ^ + 1 =fio 
{a^ — 0, =)= 0) we must have 

- 1 ) = /(/ + 1 ). 

We thus have the two possibilities : 

(Hq = / -f 1 or /xo = — /. 


Considering the first possibility and taking the coefficient a,+, 
of the lowest power as unity, all remaining coefficients can be 
obtained by successive applications of the recursion formula 
(6.9), as the denominator fx(/x + 1) — + 1) never vanishes ; 

let the solution thus obtained be denoted by v. The second 
possibility does not lead to a solution, however, as the denomi- 
nator in the recursion formula for /x = I vanishes ; the second 
solution of the differential equation can be obtained by quad- 
rature from the first and involves logarithmic terms. 

The power series for v breaks off if for a definite exponent 
M = Mo + M 

Zme^ 


or 


hoL — 


Zme^ 


h{yi I “ 1 “ 1 ) 


In this case / is of the form 


(5.10) 


g-ar . yi , (polynomial of degree n in r) ; 


it is finite at r = 0 and the integral 

\r^f{r)f{r)dr 


(5.11) 
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exists, as is to be required. The corresponding characteristic 
numbers E are the energy levels ; on writing n in place of 
M -j- ^ + 1 and solving (5.8), (5.9) for E we find 


Z^me*‘ 1 
2/1* n*‘ 


(5.12) 


The integer n, the principal or total quantum number, is 

subject to the condition n > /. There exist no other solutions 
for which the integral (5.11) converges.® 

The energy levels depend only on the principal quantum 
number n ; the terms for which n is a fixed number and 
/ ™ 0, 1, • • •, n — 1 coincide in a single degenerate term 
of multiplicity 

i-o 

'This theoretical result agrees with the empirical formulce for the 
Balmer^ Paschen, Lyman, etc., series. We find, in fact, the 
expression 

_ ZU< 

■ ~n'^ ’ ^ 47r/i V 

for the terms measured in wave-numbers The 

XZnC Zrrch/ 

expression for the Rydberg constant R in terms of the fuyidamental 
constants of nature {the charge and the mass of the electro7i, the 
velocity of light and the elementary quantinn of actioii) agrees 
numerically ivith its empirical value. All terir.s and therefore 
all actual line frequencies v depend on the integer Z describing 
the charge on the nucleus in such a way that Vv increases in 
proportion with Z. Since the X-ray terms are due to the inner- 
most electrons, which are but slightly affected by the outer 
ones, we should expect to find that the hardest X-ray lines, 
arranged in accordance witli the atomic number Z, follow this 
law. It was discovered by Moseley and gave a conclusive proof 
of the fact that on going through the eleynents of the periodic table 
the charge on the nucleus increases by e from element to element. 
This law uncovers with unerring certainty the holes yet re- 
maining in the system of known elements ; at present we lack 
but 2 (or 3) elements in the scries beginning with hydrogen, 
Z ™ 1, and ending with uranium, Z = 92. 

The characteristic functions associated with these energy 
levels, which determine the relative probabilities of the various 
positions of the electron, can be expressed in closed form in 
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terms of the so-called Laguerre polynomials. The character- 
istic function belonging to the normal state n — 1 = 0, is 

spherically symmetric ; * 


for hydrogen 





me^ 


0-532 A 


(6.13) 

(6.14) 


(According to the older Bohr theory, a is the radius of the inner- 
most electronic orbit.) a determines the order of magnitude 
of atomic dimensions. In the normal state hydrogen possesses 
spherical symmetry (according to the scalar wave theory^ — but 
see Chap. IV, § 8). 

The radial characteristic functions r ‘ f„i{r) do not, however, 
constitute a complete orthogonal system for a given I for the 
full domain which we wish to consider : in addition to the 
discrete term spectrum (5.12) we have the continuous spectrum 
covering the whole region E ^0. We go no further into this 
matter.^ 


§ 6. Collision Phenomena 

The optical phenomena show that the quantum theory leads 
to the correct energy levels, but they do not lend themselves 
to an attempt to interpret the vector 0 in system space as a 
probability. Collision phenomena, which deal with the de- 
flection of electrons or a-particles under the influence of other 
material bodies, are best suited for this latter purpose. The 
fundamental experiments of Franck and Hertz, as well as those 
of Davisson and Germer, belong to this latter category. 

Neglecting the reaction of the moving particle on the per- 
turbing body, the potential energy due to this latter may be 
taken as a given function V{xyz) of position. Considering 
a one-dimensional problem, the energy of the moving particle is 
then 

V[.). 

'SNe can think of the curve y — V{x) as the contour of a hill 
against which the particle runs. The wave equation for a 


* The normalizing factor ijslira} is calculated from 

00 

1 1 = 47r — tra*. 
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state with given energy E is 

'k 2 ^ + " “■ <«■'> 

If we neglect for the moment the perturbing field V we obtain 
as solutions of (6.1) the familiar de Broglie waves : ^ is a linear 
combination of the waves and proceeding in the positive 
and negative directions along the ;r-axis, the wave number a 
of which is determined by 

[hof.y^ ™ 2m/? or hoL — p. 

Writing 

pV(.r) lJ{x) 

equation (6.1) becomes 

+ (a> - U(.t)J ^ 0, (6.2) 

We now assume that 2ls x-> ± oo, behaves in such a way 

i-oo 

that the integral ^\U{x)\dx converges; equation (6.2) then has 
— 00 

one solution which behaves for x oo asymptotically like 

and another, which is linearly independent of the first, 
which behaves like in the same region. 

This can most readily be seen by solving (6.2) by the method 
of successive approximations. Let 

•A '/'o + '/'i + '/’2 + • • ' (6-3) 

and take as the 0*** approximation the function ; in general 
is determined in terms of i/)„ by integrating the equation 

Hence 

00 

- - - ^sinaCv - $) . t/(f) ^„(i) (6.4) 

X 

We restrict ourselves for the moment to a region .v ^ Xq such 
that 

00 

\^\U(x)\dx ^ g < 1. 
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If I'f.w I ^a„ for all x, the integral (6.4) converges and we have 

00 

X 

we can therefore take = 1, = ga„. Then a„ = g” or 

\tf>n{x)\ ^ g" for X^Xo. 

Consequently the series for ^ converges at least as fast as the 
geometric series with ratio g. It satisfies the integral equation 

00 

,p{x) - Ux) = - (6-5) 

X 

and is consequently a solution of (6.2). Since 

I^(a;)|^ 1 + + - 

(6.5) leads to the estimate 

00 

WW - «j:)I a ■ j|I/(f)|<ff, 

X 

from which it follows that if}{x) behaves asymptotically for 
X CO like ~ Not only is i/i ~ but also 

^ ~ for the equation 

I - f = - 1 

X 

gives as an upper bound for the absolute value of the difference 
on the left-hand side the quantity 

00 

r~ ■ Imm 

X 

which approaches 0 as at -> + oo. 

The solution ^(x) which we have found in the region x j?; X(, 
can naturally be extended over the entire real axis by analytic 
continuation. Since our considerations apply just as well for 
x-> — CO, we know that i//(x) satisfies an asymptotic equation 
of the form 

t(i{x) • — ' be”'^ T for x -> — oo. 
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le same time we must also have 

dX 

{x} being a solution of the differential equation, ijj{x) is 

3 + ‘ - '''(^> 1 '? - 0 - 

iply the first equation by (/i, the second by ip and subtract ; 
II d 



determinant (6.6) has the limiting value 2fa for x oo 

OV X - > — 00 

2ioL{bh — b'b'), 
ce 

bb - b'b' - 1. (6.7) 

Hows from this that b 0. On multiplying */r(,v) by l b 
ave a solution ip whose asymptotic behaviour is described 
le equations 

ip(r) ~ + a'e for x -> — oo, 

ip{x) ~ for A' -b 00 (6.8) 

e a 1 a' ----- b' b, (6.7) is now 

I particle of dejbiile energy riois agai)ist the potential oiergy 
from the left, i.e. from x oo. Il7^c^'c^^s' in elassieal 

allies the particle certainly either gets over the hill or is throicn 
according to lohether its initial kinetic energy is greater or 
than the maximum of r(A'), quantum mechanics states that 
is a probability |a|‘'^ that it gets over and a probability 
it is throion hack, h'urthermore, these probabilities are 
nuous functions of the energy of the particle ; the dis- 
nuity of the ilassii'al theory is completely broken down. 
^ perform the experiment successively with a large number 
irti('les we find that they are divided into two streams, 
cordance with (6.8.), [irocecding in the positiv^e and negative 
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directions along the ^-axis ; the relative intensities of these 
are given by 1 and |a'|* for x-^ — oo, respectively, while for 
X cx^ there exists only the positive stream of intensity 
|a|^ Equation (7.5) thus expresses the conservation of the 
number of particles and shows that we must consider the square 
\a\^ of the absolute value of the amplitude a as a relative intensity 
or probability. 

If the integral 

00 

i • I iU(x}ldx < 1 
— 00 

the solution ^ is represented throughout the whole space by the 
formula (6.3). In perturbation theory one is usually satisfied 
with the first term The theory of the familiar experiments 
of Rutherford, in which a-particles are allowed to fly in a given 
direction with given momentum into and be deflected by the 
field of an atom, has been developed by Wentzel in a similar 
manner.*® The influence of the a-particle on the atom is thereby 
neglected ; on taking it into account we are led to the theory 
of the experiments of Franck and Hertz, giving formulae for 
the dispersed particles specified according to their various 
discrete kinetic energies and their various directions. This 
calculation has been carried through for hydrogen by Born and 
Elsasser.^^ A very important application of this picture of 
corpuscular waves “ seeping ” through a potential hill has been 
made by G. Gamow and R. IV. Gurney and E. U. Condon to 
explain radioactive decay.** 

§ 7. The Conceptual Structure of Quantum Mechanics 

The fruitfulness of the theory has been amply established by 
the above applications and the examples given have served to 
illustrate its physical interpretation ; it now seems time to set 
forth its general abstract formulation. 

Consider a physical system of known constitution. Ec 
particular stale, each individual case of such a system is r 
sented by a vector j of modulus 1 in a unitary system space. Each 
physical quantity associated with the system is represente/^ yy 
Hermitian form in this space. The fundamental questi,t)n which 
we put to the theory is not, as in classical physics, “ What value 
has this physical quantity in this particular case ? ” but rather 
“ What are the possible values of the physical quantity A, and what 
is the probability that it assumes a definite one of these value <; in 
a given case ? " The answer to this question is : The probability 
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that A assumes the value a is the value £'a(j) of the characteristic 
form Ea of A associated with the value a, where the vector j repre- 
sents the case in question and the quantity A is represented by 
the Hermitian form A in the system space. The quantity repre- 
sented by A is capable of assuming only those values ol which 
are characteristic values of the form A. In accordance with the 
equations 

f (?), Ail) = 2 :« z <.( e ) 

a a, 

the sum of the probabilities is 1 and the value A(^) of the form 
A is the mean value or expectation of the quantity A in the state J. 
Since all assertions concerning the probabilities in a given state 
j are numerically unaltered when j is replaced by £ j, where e 
is an arbitrary complex number of modulus 1, we cannot dis- 
tinguish between these two cases. The pure case or state is 
consequently more properly represented by the ray J than by 
the vector and we must therefore operate in the ray field in 
system space rather than in the vector field. 

The significance of probabilities for experimental science is 
that they determine the relative frequency of occurrence in a 
series of repealed observations. According to classical physics it 
is in principle possible to create conditions under which every 
quantity associated with a given physical system assumes an 
arbitrarily sharply defined value which is exactly reproducible 
whenever these conditions are tlie same. Quantum physics 
denies this possibility. We illustrate this by the example of 
directional quantization. We know conditions under which we 
can guarantee with practical certainty that the atoms of a 
hydrogen gas are in the normal state. Let us therefore assume 
that we can create conditions under which we can be certain 
that the atoms under observation are in the quantum state (;z, /) 
with azimuthal quantum number / ^ 1 and energy E. A 
certain quantity which can, under these conditions, assume 
only the values + 1, 0, or — 1 is associated with each direction 
z in space. Stern and Gerlach have shown us how to sharpen 
these conditions so that takes on a definite one of these values, 
say Lj -- ~f 1. According to the theory the utmost limit of 
precision is then reached. If x is another direction in space, 
then under these conditions which determine and E only the 
relative probability that the quantity Lj. assumes any one of the 
values -b 1, 0, — I can be given. Why is it impossible to go 
further and insure conditions under which in addition takes 
on a definite one of the values, say 0, with certainty ? Because 
the “ measurement ” of L,, which is accomplished by separating 
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the atoms into three classes La; == + 1, 0, — 1, is only possible 
by creating conditions which destroy the homogeneity already 
existing with respect to L^. Polarization of photons is obviously 
somewhat analogous to directional quantization of atoms. The 
conditions for the production of a monochromatic beam of light 
in a definite direction determine the energy and momentum of 
the photons. To each orientation 5 of a Nicol prism corre- 
sponds a definite quantity which is capable of assuming only 
the values ± 1 ; if = -j- 1 the light goes through and if 
A, = — 1 it does not. With the aid of such a prism we separate 
out the photons for which A^ = 1 without disturbing their 
energy and momentum. The utmost limit of precision is then 
reached ; a monochromatic pencil of polarized light is the most 
homogeneous light possible. If we now place a second Nicol 
of orientation o in the path of this beam, then naturally only 
those photons which have A^^ = -f- 1 <-'an pass through. But 
the light which we thus obtain is of the same constitution as 
if the first Nicol of orientation 5 were not used at all ; the con- 
dition that all the photons have A, — -f- 1 is obviously destroyed 
by the second Nicol. 

Natural science is of a constructive character. The concepts 
with which it deals are not qualities or attributes which can 
be obtained from the objective world by direct cognition. They 
can only be determined by an indirect methodology, by observing 
their reaction with other bodies, and their implicit definition is 
consequently conditioned by definite laws of nature governing 
reactions. Consider, for example, the introduction of the 
Galilean concept of mass, which essentially amounts to the 
following indirect definition : “ Every body possesses a mo- 

mentum, that is, a vector mt) having the same direction as its 
velocity t) ; the scalar factor m is called its mass. The mo- 
mentum of a closed system is conserved, that is, the sum of the 
momenta of a number of reacting bodies is the same before 
the reaction as after it.” On applying this law to the observed 
collision phenomena data are obtainable which allow a deter- 
mination of the relative masses of the various bodies. But 
scientists have long held the opinion that such constructive 
concepts were nevertheless intrinsic attributes of the ” Ding an 
sich,'" even when the manipulations necessary for their deter- 
mination were not carried out. In quantum theory we are con- 
fronted with a fundamental limitation to this metaphysical stand- 
point.^^ 

We have already seen, toward the beginning of this chapter, 
that a co-ordinate x and its associated momentum p stand in 
a peculiar relationship to one another : the precise determina- 
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tion of either one of these quantities precludes the precise 
determination of the other. In the state represented by the 

wave function i/i(;r) ^ = ij the mean values Aro= {x} and 

— 00 

Po = <p') are given by 

+ 00 +00 

^x ij/{x) tf){x)dx and j^tji^dx. 

— 00 —CO 

No loss of generality is incurred by taking these mean values 
as zero ; the first can be made to vanish by replacing x by 
X — Xq or tp{x) by ifj{x + Xq) and the second by replacing iIj{x) 

by e(^ ~ • ipix). The mean values 

ip ~~ PoY then given by 

f >00 

(A.r)2 ^x‘^ilf{x){lj{x)dx, 

— CO 

- X - X' 

From these expressions the general inequality 

Ap • A.r 1 * Ui 

can readily be obtained (I am indebted to IF. Pauli for this 
remark) ; the less the uncertainty in .v, the greater the un- 
certainty in p, and conversely.* 

In general the conditions under which an experiment is 
performed will not even guarantee that all the individuals con- 
stituting the system under observation arc in the same “ state,” 
as represented in the quantum theory by a ray in system space. 
This is, for example, the case when we only take care that all 
the atoms are in the quantum state («, 1) without undertaking 
to separate them, with respect to m by means of the Stern- 
Gerlach effect. In order to apply quantum mechanics it is 
therefore necessary to set up a criterion which will enable us to 
determine whether the given conditions are sufficient to insure 
such a “ pure state." We say that the conditions ©' effect 
a greater homogeneity than the conditions © if (1) every quantity 
which has a sharp, reproducible value under® has tlie same definite 


Cf. Appendix i at the end of the book. 
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value under and if (2) there exists a quantity which is strictly 
determinate under but not under ©. The desired criterion 
is obviously this : The conditions © guarantee a pure slate if it 
is impossible to produce a further increase in homogeneity. (This 
maximum of homogeneity was obtained in classical physics 
only when all quantities associated with the system had definite 
values.) 

In the pure state represented by the vector a ~ (^,), a quan- 
tity Q represented by the Hermitian matrix Q ~ has the 
expectation or mean value 

\ 0 > = Zak^iqik- 

i, k 

The numbers 

^tk “ ^t^k 

are the components of a positive definite Hermitian form A of 
trace 1, i.e. 

I(aj)|’‘ = Ia,Xi • 2;d,x,. 

t % 

(Positive definite is to be understood here in the weakened 
sense ^(j) ^ 0.) It is to be noted that \Q/ depends linearly 
and homogeneously on the quantity l|<y,fcl| under consideration : 

Q-tr (AQ). (7.2) 

If a statistical aggregate A is created by subjecting a large number 
of individuals of the physical system under observation to the 
conditions (5, then the mean value of a physical quantity Q 
will be given by (7.2) where A is a certain positive definite 
Hermitian form of trace 1 which is characteristic for the 
aggregate — even if the conditions ® do not guarantee maximum 
homogeneity. The reason for this is that (7.2) is still correct 
if we mix statistical aggregates, each of which does possess 
maximum homogeneity, in any proportions ; any statistical 
case may indeed be considered as a mixture of pure states. 
As y. V. Neumann has remarked, this formula (7.2) can be derived 
from the simple axioms ; 

1. If P, Q are physical quantities and A a real number, then 

<AP> = A<.P/, <p + 0) = <p> + <a>. 

2. If the quantity Q is capable of assuming only positive 
values (i.e. if the form Q is positive definite), then <0/ ^ 0. 

3. If 0 is a pure number, i.e. if it is independent of all 
physical conditions, then ^0> = 0. 

Assuming not only that any physical quantity 0 is repre- 
sented by an Hermitian form, but also that conversely any 
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Hermitian form represents some quantity associated with the 
system, it follows from (1) that 

kQ/ = 

i,k 

where the coefficients a^i are independent of Q, (We shall 
return to this assumption in Chap. IV, § 9.) The matrix 
A = !|a,jt|l must be Hermitian since <j2> is always real. On 
bringing A into the normal form EcLiXii^ (2) requires for the special 
Hermitian forms of the type Q — that ^ 0 for 

arbitrary non-negative values qi ; consequently ^ 0 and A 
is positive definite. 

The probability that in the statistical aggregate A the quan- 
tity Q assumes the value k is 

w = tr {AE^ (7.3) 

where is the idempotent form associated with the character- 
istic number k. 

We can also distinguish “ pure states ’* among general sta- 
tistical aggregates, “ mixed states,” by the fact that they cannot 
be obtained by mixing two or more different statistical aggregates. 
This corresponds to the theorem that an Hermitian matrix A of 
the form (7.1) is not expressible as the sum 5 -f- C of two positive 
definite Hermitian forms B and C which arc not merely multiples 
of A, This can be readily proved on taking the vector q — (a,) 
as one of the co-ordinate axes in system space. The positive 
definite Hermitian f(»rms A with unit trace, i.e. the statistical 
aggregates, constitute a convex region (5 in the sense that with 
A and B their ‘‘ centre of mass ” XA + (A, ^ arbitrary positive 

numbers wffiose sum is unity) belongs also to ©. A point of © 
which cannot be considered as such a centre of mass of two 
points of © distinct from the point in question is called, following 
Minkowski, an ''extreme points © is the "convex core" of 
the class 6 of all extreme points, i.e. it is the smallest convex 
domain which includes all the points of 6. We cannot dispense 
with a single extreme point of © ; if we leave out but a single 
point of ® the entire convex core shrinks together. We may 
accordingly characterize the pure states as the ” extremes ” among 
all the possible statistical aggregates. 

It is often convenient to dispense with the normalization 
tr A " I ] (7.3) then gives the relative rather than the absolute 
probabilities. The simplest statistical aggregate is that one 
characterized by the unit Hermitian form with matrix 1 ; it 
represents total ignorance. In thermo-dynamics the important 
role is played by the canonical aggregate A == H is here 
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the Hermitian form which represents the energy, k the Boltzmann 
constant and the number 0 the temperature.^® 


§ 8. The Dynamical Law. Transition Probabilities 

Having considered the general probability laws of the quantum 
theory, we now turn to the dynamical law governing the change 
in the state J of a physical system during an interval dt of time. 
The dynamical law states that this change is effected by 

idt 

the infinitesimal unitary operator — • II, where H is the 

Hermitian form which represents the energy : 


i dt 




(8.1) 


The peculiar significance of the energy in quantum mechanics 
is due to its appearance in the dynamical law. We also consider 
this law as a fundamental axiom of quantum theory of universal 
validity. For the matrix X : 

Xik = XiXk, 

which characterizes a statistical aggregate of the pure state 
described by the vector j = (x,) [cf. eq. (7.2)], we obtain the 
equation 

i - XH - II X (8.2) 

on applying (8.1) and taking into account the fact tlic II is 
Hermitian. This same equation also governs the change in 
time of a statistical aggregate X for a mixed state.*” 

For the integration of (8.1) it is convenient to choose as our 
co-ordinate system the characteristic vectors of H ; tlie corre- 
sponding characteristic numbers E„ are the energy levels. We 
call this particular system the Heisenberg co-ordinate 
system, as Heisenberg tacitly employed it in his fundamental 
paper on quantum mechanics. This Heisenberg co-ordinate 
system is in general not uniquely determined ; the essential 
point is the decomposition of the system space 5H into the 
characteristic sub-spaces SR" = ‘S{{E"), • • • as- 

sociated with the various characteristic numbers E' , E" , • • •. 
The states represented by vectors J in such a characteristic 
space are called quantum or stationary states ; in them the 
energy has a sharply defined value. The cases in which // 
possesses only discrete characteristic numbers include “ con- 
ditionally periodic motion,” the only ones for which the older 
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quantum theory could be formulated. The nomenclature and 
symbolism employed in the following is adapted to discrete char- 
acteristic spectra, but this by no means precludes the possibility 
that the spectrum is entirely or partly continuous. Equation 
(8.1) becomes, on resolving it into components with respect to 
Heisenberg’s co-ordinate system, 


h dx„ 


"T 


i dt ' ” 


and has as solution 

Xn[^) = Xn • {E„ = hu„). (8.3) 

This is an explicit formulation of the unitary transformation 
j -> j(/) = U{t)TC which the state vector j undergoes in time t. 
Since |Ar„(i)j®is constant, the probabilities for the various energy 
values do not change in the course of time. The finite law 

X{t) = U(t)XU~^{t) (8.4) 

for the dependence of the statistical state X{t) on the time t 
is fully equivalent to the differential law (8.2). 

The mean value q — q{t) of the physical quantity represented 
by the fixed Hermitian operator Q : 

q{t) = tr [Y(0-C?] 


can, on taking into account the symmetry properties of the 
trace, be written also in the form 

q{t) = tr [X • Q{t)] 

where 

Q{t) = U-'{t)QU(t). (8.5) 


Consequently the situation can be described either by con- 
sidering Q as fixed for all time and the statistical state X{t) as 
varying with the time in accordance with the law (8.4) — and 
this is the fundamental stand taken by quantum mechanics— 
or we can take the initial state X as representing the state of 
the system for all time and allow the operator Q{t) representing 
the quantity Q to vary with time in accordance with the law 
(8.5). This latter interpretation lends itself to comparison with 
classical mechanics. (8.6) is equivalent to the differential law 


hd^ 

i dt 


for in virtue of (8.2) and (8.6) 
dq _ /dX , 


- dQ 


HQ - QH, 


( 8 . 6 ) 
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In particnlar^ the quantity Q is constant in time, i.e. the proh- 
abilities associated with it do not change in course of time, if the 
Hermitian form Q which represents it commutes with H. 

In Heisenberg’s co-ordinate system equation (8.5) becomes 

qmn{t) = qmn ' ^ (8.7) 

The matrix Q[t) is thus expressed in terms of components per- 
forming simple oscillations with frequencies The 

corresponding amplitude is On going over from the m^^ 

to the n^^ stationary state the system loses an amount h{v^ — v^) 
of energy ; if this energy is radiated as light, its frequency 
is given by 

= I'm — *'n- (8.8) 

Classical mechanics collects together all the transitions from 
a fixed level m to all possible levels n — 1, 2, • • • into a single 
state of motion, the motion of the system in the w*'* quantum 
state, whose harmonic components have the corresponding 
transition frequencies »'m 2 . ‘ ’ ’• For any quantity A it 
therefore associates a constant amplitude a„„ with the transition 
m n. But in classical mechanics (for systems with one degree 
of freedom) we have 

Vmn = k • <o(«), k = m — n, 

instead of equation (8.8). On multiplying the two Fourier 
series A, B 

and • e‘*"‘ 

k k 

we obtain the Fourier series C with coefficients 
Ck = Zarb, (r + 5 ^ k). 

Accordingly classical mechanics associates with the quantity 
C — AB the amplitudes 

^mn ■ Z^m, m-r " ^mi m— J (r -f- 5 — W1 m), (8.9) 

whereas quantum mechanics assigns to it the amplitudes 

^mn Z^mi btn Z^m^ m—r * r, n * (8.10) 

t r 

The difference between these two results lies in the fact that in 
(8.9) both factors a, b have the first index m in common, whereas 
in (8.10) the first index of b is the same as the last index of a. 
This is in exact analogy with the difference between the “ classical ’’ 
and the correct Ritz-Rydberg combination principle. This was 
Heisenberg’s starting-point ; the correet combination principle 
indicates the pertinent -fact that the rule (8.9) for the multi- 
plieation of amplitudes must be replaced by (8.10). Admittedly 



THE DYNAMICAL LAW 83 

such multiplication is not commutative, and it collects together 
amplitudes which the older model assigned to different orbits. 

We denote as the intensity of the quantity A in the 

transition m n. When multiple energy levels occur (“ de- 
generacy ”) only the sum 2J\a^n\^y extended over all indices 
m for which E* and all indices n for which E^ ~ 

has an invariantive significance ; in such a case this sum is 
taken as the intensity of A in the transition E’ -> E'\ If A^ 
is that portion of A in which intersects 9^(£") the sum 

defined above is the trace of A^A^. 

Consider an atom with one or more electrons and let t be 
the vector from the nucleus to a representative electron. Then 
q =r-. or in case there is more than one electron the sum 
q ~ extended over the various electrons, is the electric 

dipole moment of the atom. In classical electrodynamics the 
intensity of the light of frequency v emitted by the atom is calculated 
from the amplitude q(i/) of the harmonic components of q with 
the same frequency v in the following manner.f The rate at 
which energy Hows through a surface element do at the point P, 
whose distance from the atom at 0 is large compared with the 
wave-length, is given by 


where is the component of q perpendicular to OP and do) is 
the solid angle subtended at 0 by do. We have further assumed 
that the wave-length under consideration is large compared with 
the radius of the atom. Since each photon of frequency v 
carries with it energy hv^ we postulate that this law is to be 
taken over into quantum theory as follows : the probability 
that an atom in state n goes over into state n' in unit time and 
emits a photon of frequency v, whose direction lies within the 
solid angle du)^ is given by 

''''■•I’ ■ '*■"> 

We thus arrive at a definite rule for the calculation of the intensities 
of the lines emitted by the atom. The fact that we can now make 
such a prediction indicates a distinct superiority of the new 
theory over the old. In particular^ the transition n -> n' does 
not occur if the corresponding coefficient in the Hermitian form 

t By this we mean that the terms -f occur in the harmonic 

analysis of q. 
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for q is zero. This constitutes the general selection rule. The 
connection between the state of polarization of the emitted 
light and the direction of oscillation of the electric moment is 
also carried over into quantum theory. But a real derivation 
of our intensity rule can naturally only be obtained by con- 
sidering the question of interaction between the atom and the 
ether ; see § 13. 


Examples : \. The Oscillator. 

The Hermitian form 

+ 00 

fi{x) >f>{x) dx, 

— 00 


representing the co-ordinate x of the oscillating particle has, 
as we have already found [(3.11)], the coefficients 


n-l 




0 
hn 
2moj 


if n' 


^n. n+1 


+ n ± 1 

V' 


/i(n -f 1) 
2mw 


( 8 . 12 ' 


with respect to Heisenberg’s co-ordinate system, in which the 
energy is referred to its principal axes. We thus obtain the 
selection rule w -> n ± 1 i the quantum number n can only chatige 
by ± 1| the oscillator then absorbing or emitting a photon of fre- 
quency V = to and energy hoy, in accordance with (3.10). The 
selection rule makes it clear why no higher harmonics are ex- 
cited in the simple oscillator. We have also found that the 
matrix ||p„„'[|, which represents the linear momentum in Heisen- 
berg’s co-ordinate system, is given by (3.12) 

. _ 1 Ihmotn ^ _ 1 /At«<o(n+ 1) 

Pn- n-l — ~ — 2 — ’ ~ 7 \ 2 

p„n> = 0 for n' 4= n ± 1 
2. Electron in spherically symmetric field. 

The result (4.7) for surface harmonics yields the selection rule 

l->l±l (8.14) 

for the azimuthal quantum number / ; for / = 0 only the transition 
0 1 is possible. On introducing the magnetic quantum number 
m as in § 4, the characteristic functions depend on the 
meridian angle <f> about the z-axis only in the multiplicative 
factor e '”^* ; here 

X :iz iy = r sin 0 ' e^'* , z — r cos 0. 


I ( 8 - 13 ) 
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In order to obtain the dependence of the matrices + iqy^ 
qjc — iqyy qz on the transition in-> m' we must evaluate the 
integral 

2ir 

^e{a<f)) e{— m<j>) e{m'<f>) d<f>, 

0 

where a = 1, — I, 0, respectively. The integral vanishes 
unless m' -f a — m. The only components of qj, + i^y which do 
not vanish are those corresponding to the transitions m -> m \ 
in which the magnetic quantum number decreases by I ; for 
— iqyy m -> m -f 1 ; for q^^ m-> m. 

This last selection rule cannot be obtained from the spectra 
themselves as long as the terms corresponding to different 
values of m {\m\ ^ /) coincide. But these terms arc broken 
up into their various components by a homogeneous magnetic 
field in the direction of the 2 -axis [Zeeman effect). On “ longi- 
tudinal ” observation of the light emitted in the 2 -direction we 
find instead of the one line (n, /) -> (n', V) several left- and right- 
circularly polarized components, the former of which arise from 
the transitions m m — 1 and the latter from m -> m -f~ 

On “ transverse ” observation, e.g. along the y-axis, we find 
two transverse linearly polarized lines arising from w -> m ± I, 
and in addition a longitudinally (i.e. along the 2 -axis) polarized 
line corresponding to the transition m -> m. (Polarization as 
here used means the direction of oscillation of the electric dipole, 
and therefore the direction of the electric field strength.) 

In the term spectrum of the alkali elements^ which is, however, 
typical in this respect, even for the more complicated spectra 
of the other elements, we distinguish between several series by 
means of the letters s^ p, d, fy gy * * *• Each series consists of 
infinitely many terms which we number in the direction of 
increasing frequency by the integer n. It is found convenient 
to let n run from 1 on in the ^-series, from 2 on in the p-scrics, 
from 3 in the rf-series, etc. The values of the terms nSy np^ 
nd, • • • are then given by the “ hydrogcn-likc formula 

_ R 
[n 4- k)2’ 

in which k ~ ac.„ Kp, • • • is a correction term depending but 
slightly on n, the numerical value of which but rarely exceeds 
1/2 and is very close to 0 for high scries [f,g, . . .). Only terms 
lying in neighbouring series combine to produce a liney i.e. an 
5-term combines only with a /)-term, p only with 5 and d, d with 
p and /, etc. In particular, the transitions np Is give rise 
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to the principal series, which also appears in absorption, nd 2p 
to the lines of the diffuse series, ns -> 2p to the sharp series, 
and nf Zd to the Bergmann series, 

The alkalies A are univalent, i.e. in chemical reactions only 
one electron, the valence electron, plays a role ; the others, 
together with the nucleus, constitute an inert closed shell. It 
is therefore reasonable to assume that the optical spectra of 
the alkalies are caused by quantum jumps involving only this 
valence electron, while the core remains in its normal state. 
We have seen above that hydrogen in the normal state is re- 
presented by a spherically symmetric wave function ip ; we 
therefore assume, disregarding the reaction of the valence 
electron on the core, that this feature of the core being “ closed ” 
is to be expressed by ascribing spherical symmetry to it.* We 
have then to deal with the problem of an electron in a spherically 
symmetric field, which we have already discussed above. In 
accordance with the empirical combination principle and the 
theoretical selection rule for the azimuthal quantum number I, 
the s, p^ d, f, • • • terms are to be taken as having / =: 0, 1, 2, 3, 

• • • respectively, n then runs from / + 1 on in the series with 
azimuthal quantum number /, as in hydrogen.** 

§ 9. Perturbation Theory 

The problem with which perturbation theory is concerned is 
the following : Let the energy H consist of two terms W^~^H \~eW , 
the second of which, the perturbation term elT, is small compared 
with the first ; this we express by the “ infinitesimal ” numerical 
constant £, of which powers higher than the first are to be 
neglected. Assume that the quantum problem for the “ un- 
perturbed system ’’ with energy H has already been solved, so 
that the Hermitian form H has already been brought into 
normal (diagonal) form, and let 91', 9i", • • • be the character- 
istic spaces of H with characteristic numbers E" , • • •. The 
problem is to find the solution of the equations for the “ per- 
turbed system ” with energy H. 

In order to illustrate the typical difference between degenerate 
and non-degenerate systems we first consider the system space as 
2- instead of oo -dimensional ; then 

H = 0 1 

♦ Why He and not H is the first closed atom is only to be understood as 
the result of a profound modification of wave mechanics ; see Chap. IV. 

♦♦Concerning the introduction of the "true quantum number" for 
elements other than hydrogen, see Chap. IV, § lo. 
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If El 4 = E2 the unitary transformation which brings H into 
diagonal form differs from the identity only by terms of order 6 . 
Consequently the probabilities \xiY, \x2\^ that in the pure state j 
H has the values Ei, E^ will change only by amounts of the 
same relative order e ; they remain constant to the same ap- 
proximation with which elY may be neglected in comparison 
with H. But the situation is quite different for degenerate 
systems, for which = £*2 = for the principal axes of H 
are then indeterminate and this arbitrariness is expressed in 
the “ instability ” of the system under the influence of a per- 
turbation. We set up that normal co-ordinate system Ci', 
in which W assumes the diagonal form ; the co-ordinate vectors 
are then also characteristic vectors of H, since Ei = E^. But 
these vectors can obviously differ arbitrarily from the original 
co-ordinate vectors Cj, 02, whereas the energies hvi\ hv^ can only 
differ from E by a term of order e. On returning to the original 
co-ordinate system we have 

AT, = a,, • e[- vyt) + flu • e{- v^t), 

= ^21 • e{- V,'/) + • e{- 

where Oj = («!,, Qj = (ai2, are two mutually per- 

pendicular vectors whose directions coincide with those of e/, 02'. 
The probabilities for the two states 02 vary periodically in 
time with the small beat frequency v/ (resonance between 

states 01, 02). Quantum states with the same eftergy are therefore 
ui resonance ivith one another. The magnitudes of the components 
of j in the characteristic spaces JR', JR", • • •, i.e. the probabilities 
for the various numerically different values of H remain ap- 
proximately constant under a small perturbation, but this is 
not the case for the absolute values \xn\ of the individual com- 
ponents Xn resolved along the axes of an arbitrary Heisenberg 
co-ordinate system of the unperturbed system. 

In accordance with the foregoing we can formulate the 
perturbation problem in two forms : I. Determine the change, 
due to the perturbation, in those states in which the energy 
H of the unperturbed system is determinate. This formulation 
has a sound physical interpretation if we consider the perturba- 
tion as acting during a time interval /j, /2- We then find how the 
probabilities for the various quantum states change under the 
influence of the perturbatio?!.^^ II. Determine the quantum 
states and energy levels of the perturbed system, i.e. the char- 
acteristic values and characteristic spaces of H. We ask in 
particular how the terms are broken up and displaced under the 
perturbation. We consider II first. 
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We first decompose the Hermitian form IV into two parts : 
Wo + U. To the first belong those portions of W in which 
a characteristic space 91', 9i", ••• of H intersects itself, and to 
V those in which two different characteristic spaces intersect. 
If the characteristic values of H have but finite multiplicity 
the problem of bringing W', that part of W in which 91' intersects 
itself, into diagonal form deals only with the space 91' of a finite 
number of dimensions. If 91' is not simply a one-dimensional 
space, the resonance phenomena mentioned above will appear. 
The co-ordinate system, consisting of characteristic vectors of 
H, is now more precisely specified, for now Wg also appears as 
a diagonal matrix ; let E„ be the characteristic values of the 
H+eWo= Ho so obtained. The single term value E' asso- 
ciated with 91' has in general been resolved into as many different 
characteristic values E„ of H® as there are dimensions in the 
sub-space 91'. 

The remainder V — l|Wm„l| of the matrix is such that v„„ — 0 
if the characteristic values E„, E„ of H are equal. The in- 
finitesimal unitary rotation 

hx = E - Cx, U = l|<^mnl|, 
of order s transforms H into H -f 8H where 

8H = e(HC - CH) -- e{HC - CH). 

On choosing this transformation in such a way that 8H — — eF, 
H = Ho4- eF goes over into Hq ; this can be accomplished by 
choosing c^n = 0 if Em = En and 

^fltn 

Cmn ~ p _ p 

otherwise. The characteristic values E„ of Hq are therefore the 
energy levels of the perturbed system of energy H if we neglect terms 
of order 

can be considered as the time mean of the perturbation 
terms, averaged over the motion of the unperturbed system. 
For by (8.7) the mean value of the element of the matrix 

A{t)^ which represents an arbitrary physical quantity of the 
system, is or 0, according as == or not. In statistics 
angular brackets are often used to denote the mean value of 
a quantity ; we may therefore write 

The solution of II naturally provides an answer to the 
question I. But it is more convenient to employ the method of 
variation of constants for the calculation of the effect of the 
perturbation over a limited time interval — the smaller the 



THE PROBLEM OF SEVERAL BODIES 


89 


constant e, the longer we may take this time interval to be. 
Assume that at time i = 0 the system is in the quantum state 
0 and that the perturbation begins to act at this time ; we ask 
for the probability that the system will be found in the state 
n at time t. That is, we seek that solution of the equations 

_ r,- V X 4- - y W X (n ^ 0 I 2 • • ■] 

which reduces to 

at time t ~ 0. Writing 

e{— V„t) 

the equations for are 

^ ^ r H' ( • 

•bn i Ai^''nmbm^ > 

I ft m 

for e 0, -- 0. Neglecting terms of order e^, we can take 

the initial conditions 

^ I, 0 

as the 0*^ approximation ; on substituting these values in the 
equation we obtain as the first approximation 


L 


h 




-- 1 


(^n + >^o)- 


On setting Vq — the desired probability is 

1 — cos (W) 




IE, 


= -- I,Z‘ (9-1) 

It is to be noted that in ac cor dame icnth this result the probability 
of transition from state 0 to state n is determined by In 

the case of resonance ” ^'u) transition probability in- 
creases at first with the square of t : 


§ 10. The Problem of Several Bodies. Product Space 

A physical system consisting of Itvo particles of masses m, m' , 
co-ordinates xyz; x' y' z' and linear momenta p', has as 
its Hamiltonian function 

H = ^ (Pi + Pi + P?) + ^ (K‘ + p7 + p7) 

+ V{xyz] x' y' z'), (lO.l) 
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ere V is the potential energy. We assume, as in the older 
ysics of central forces, that we are here dealing with an action 
a distance so that the potential energy depends only on the 
lultaneous positions of the two particles. This assumption 
turally breaks down when, in accordance with the theory of 
ativity, we take into account the finite velocity of propaga- 
n of the disturbance, which requires the introduction of a 
d. The wave function ip of the system will depend on all 
co-ordinates xyz] x' y' z' in addition to t] the operators 
•responding to these functions in the domain of such functions 
are multiplication hy x, • • • ; x\ • • \ and to the linear 

imenta correspond the derivatives ^ — , • • • ; r — • • *. 
^ t 7 )x t 7 >x 

3 m (10.1) we then obtain the wave equation 

3 must ask for the probability that the one particle is to be found 
a point P andy simultaneously y the other is to be found at a 
Int P\ The probability density is accordingly to be computed 
a 6-dimensional space witli co-ordinates x y z ] x y' z . 
deed, the wave field is not to represent directly occurrences 
cing place in physical space, but is to determine the appear- 
ce at definite positions or with definite energies and momenta ; 
3re is consequently nothing absurd in the fact that its medium 
this abstract 6-dimensional configuration space. 

In order to be independent of the special procedure by which 
; scalar wave mechanics puts together two systems a, b to 
m a single system C, as suggested by this example involving 
3 Hermitian forms representing the co-ordinates and momenta 
the two systems, we must first discuss the multiplication of 
ices from a purely mathematical standpoint. 

With each vector 5 = (x,) in a space 91 of m dimensions and 
:h vector t) = (yf) in a space © of n dimensions there is 
jociated a vector j = J X h with components 

Zik = Xiyk ( 10 . 3 ) 

an m • n-dimensional space 2 ^ = 9 fi X ©, the product space. 
e components are here numbered by means of the index 
ir {i k) = 1 . The totality of vectors j = 5 X Q do not them- 
ves constitute a linear manifold, but their linear combinations 
the entire product space %. With the linear correspondences 
in 9 ? and B in <S> : 

x\ = Zdi'i Xi, y\. = Z^vk yk 

i k 
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is associated a linear correspondence C = A X 5 in J : 

x'i’ y'f = lai-i 6*.* y*, 

i,k 

or 

2 /' = Zcvi^i, Cvi = ai.,hu'k [I = {ik), V = [i'k')]. 

I 

Naturally, to this multiplication corresponds the law of com- 
position 

[A X B)[A^ X B,) = {AA^ X BB,), 

where A^ A^ are correspondences of 5R on itself and 5, B^ are 
correspondences in ©. A co-ordinate system in 9fi and one in 
© together determine a co-ordinate system in J ; if the co- 
ordinate system in 5R is subjected to the transformation A and 
that in © to the transformation B, then the co-ordinate system 
associated with them in % undergoes the transformation A X B. 
In accordance with the equation 

• y* + AT, • (/y*, 

to the infinitesimal correspondence // in 9i, 3^ in © corresponds 
the infinitesimal correspondence 

(// X C) + (Ir X 7] (10.4) 

in where E, 1., denote the unit matrices in ©, respectively. 
All of the foregoing is applicable to arbitrary vector spaces. 
When 31 and © are both unitary spaces, then % is also, for by 
(10.3) 

2 ’=, 2 , - • Z'h-yk 

is an invariant if 2’x,.r,, 2Jffityk ; A X B is unitary if A and 
B are. 

Accordingly, two physical systems a and 9 are compounded 
to form a total .system c as follows. The system space of C 
is 31 X ©, where 31 is the system space of a and © of ft. Let 
the arbitrary physical quantity a in 31 be represented by 
the Hermitian form A ; on replacing all these forms A by 
A X L, where 1, is the unit form in an arbitrary space ©, there 
exist between these latter exactly the same relations as between 
the A, so that from the solution of a quantum problem in 31 
there arises a solution for the corresponding problem in 31 X ©, 
but there exists no real distinction between the two. In the 
system c obtained by composition we have therefore to as- 
sociate the Hermitian form A X 1, with a quantity a of a and 
Ir X R with )3 of ft, where A, B are the forms associated with 
a, )3 in 3i, ©, respectively. The totality of quantities of the 
composite system C is obtained by starting from the quantities 
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belonging to the component systems a and ft and multiplying 
and adding them together in all possible ways. The quantities 
a of a commute with the quantities of ft, for 


(A X l,)(lr X B) ^ A X B ^ {ir X B){A X 1,). 


We refer to the content of these last two sentences when we 
say that C consists of two kinematically independent parts a and ft. 

The two systems are dynamically independent if the energy 
H of the composite system is the sum of the energies HO, 
of the partial systems : 

H = (HO) X 1) + (1 X HO)). 


The infinitesimal unitary correspondence ^ ‘ the total 

system space is then that one which is due to the infinitesimal 

unitary correspondences ^-HO), ^ • HO) in the two original 

system spaces [(10.4)]. If HO) and HO) are both in diagonal 
form, then H is also, and the characteristic numbers arc given 
by 

El £[0 + £1"’ or V, - rO) + vf [/ - {ik)] 


If we have a pure state for the total system which is repre- 
sented by the vector c of absolute value 1 and components 
r,jt, and if j 3 = 1|?«>'|1 arbitrary quantity in a, then the ex- 
pectation of Q in the pure state c is 

(Q) — 

This has the form (7.2) with 
A{%) is the Hermitian form 

z\i:cikXiY 

k i 

in 9R. But we see from this that we are not dealing with a pure 
state in 0 , for will not in general have the form a,d,<. Con- 
ditions which insure a maximum of homogeneity within c need 
not require a maximum in this respect within the partial system 0 . 
Furthermore : if the state of a and the state of ft are knoivn, the 
state of t is in general not uniquely specified, for a positive definite 
Hermitian form ||a,ji., in the product space, which describes 
a statistical aggregate of states c, is not uniquely determined by 
the Hermitian forms 


Zc^ik, i k) ik' 
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to which it gives rise in the spaces 91, ©. In this significant 
sense quantum theory subscribes to the view that “ the whole 
is greater than the sum of its parts,” which has recently been 
raised to the status of a philosophical creed by the Vitalists 
and the Gestalt Psychologists. 

The kinematically independent parts into which a system 
can be resolved need not be spatially separated, nor need they 
even refer to different particles. We can, for example, resolve 
a single particle, whose physical quantities can all be expressed 
in terms of x, y, z ] p^, py, p^, into three partial systems with 
fundamental quantities x, p^ \y, py\ z, />,. For quantities 
which belong to different partial systems, for example a quantity 
which can be expressed in terms of at, p^ alone and one which 
is in terms of y, py alone, commute with each other in the sense 
of matrix multiplication. 

In the perturbation theory we are usually concerned with a 
system which consists of two kinematically independent parts 
and which arc almost dynamically independent. Disregarding 
the interaction tW for the moment, let and hpr be the energy 
levels of the two parts, so that h{v„ -j- p^) are the energy levels 
of the unperturbed total system. On writing in equation (9.1) 
s — (n, r) in place of 0 and s' = (n', r’) in place of n, whence 

~ + Pr) — + pr) — V„„’ + Prr' 

^nn' ’ Rrr' Pr Pr'» 

we find as the probability that the total system goes over from 
the state s to the state s* during time t : 


^ + Prr')t 

+ prr'Y^ 


W{nr^ n'r) ^ 


(10.5) 


The probability that the first system will be found in the state 
71 after time /, the total system having been in the state .9 — [nr] 
originally, is obtained from (10.5) by summation with respect 
to r'. 


§ 11. Commutation Rules. Canonical Transformations 

The development of wave mechanics in §§ 1-3 went beyond 
the general scheme of §§ 7 and 8 in that it employed certain 
specific Hermitian forms to represent the co-ordinates and 
momenta of the particle. We are now interested in seeing how 
this can be formulated in an invariant manner, without recourse 
to any special co-ordinate system in system space. 

For the Hermitian forms p representing a rectangular 
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co-ordinate and its associated momentum we postulate the 

commutation rule 

pq-qp = il. ( 11 . 1 ) 

If the system has only one degree of freedom, these two quantities 
appear as canonical variables in classical mechanics. All physical 
quantities of the system are then functions of p and q ; in order 
to avoid complications we restrict ourselves to polynomials f in 
p and q, and assume, in particular, that the Hamiltonian function 
H has this form. What are we to understand by the derivatives 
fp and /(, of / with respect to p and q in this domain in which 
p and q are not commutative in multiplication ? We should 
in any case require that differentiation with respect to q should 
obey the following postulates : 

(^) ~ ~ 1 ! 

(2) (/ + g), = A + and (a/), = a • /„ where a is a number ; 

( 3 ) {fg)<i — fa ‘ g + / ' gd- 

We see immediately that these conditions uniquely determine 
the derivative of a polynomial /, unless they happen to lead to 
contradictions. But that they do not lead to contradictions 
can be seen from the fact that they are obeyed by the definition 

ih‘fa=fp-pf. (11.2) 

(1) follows immediately from the commutation rule (11.1), and 
the linearity (2) of the process is evident. (3) is proved by the 
formula 

ifs)p - Pifi) = figp - Pg) + [fp - Pl)g 

which involves only the distributive and associative character 
of matrix multiplication. Similarly we can show that 

-ih'fp=fq~qf. (11.2) 

The fundamental dynamical law gives us the equation (8.6) : 


for any Hermitian form /. On applying this equation to p and q 
— which obviously suffices to establish the corresponding result 
for any polynomial f of p and q — and comparing it with the 
formulae (11.2) applied to the particular function H, we are led 
to the familiar Hamiltonian equations of classical mechanics : 



p> 


dp 

It 




(11.3) 
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It is a universal trait of quantum theory to retain all the relations 
of classical physics ; but whereas the latter interpreted these re- 
lations as conditions to luhich the values of physical quantities were 
subject in all individual cases^ (he former interprets them as con- 
ditions on the quantities themselves^ or rather on the Hermitian 
matrices which represent them. This is the more significant 
formulation which the new quantum theory has given Bohr’s 
correspondence principle. 

The commutation rule (11.1) is of a rather remarkable 
nature. It is entirely impossible for matrices in a space of a 
finite number of dimensions, and it alone precludes the possi- 
bility that in an oo -dimensional space q [or p) have only a discrete 
spectrum of characteristic numbers. For on referring q to its 
principal axes 

9=|kmnll, gnn = (]n, ?mn = 0 (w tt) ; | |p„„l |, 


the left side of the commutation rule has the components 
Pmni^Jn ^m) i hcncc tlic main diagonal consists of nothing 
but zeros ! The question arises as to whether it can be con- 
cluded from (11.1) alone that the forms representing q and p 
can always be given the form 


+00 +00 

j a; ili{x) if>{x) dx, ^<Ji(x)-i^dx 

— OO — 00 

for an arbitrary vector with components >fj{x) on employing 
an appropriate co-ordinate system in system space. We shall 
see in Chap. IV, § 15, that, on introducing a certain irreducibility 
condition, this is in fact the case. 

On taking into account the three space co-ordinates and 
their associated linear momenta (a = 1, 2, 3), we have in 
place of the one commutation rule (11.1) the following : “ 


— PfiP- 


= 0. 

‘10P« — 7 ^ 0 .( 3 , 


= t) 

^^0 = (o 


0 for all a, j3 ; I 
(a [ 

(a + ^)- ) 


(11.4) 


The same commutation rules apply to the case in which we have 
several particles, the only difference being that then a runs 
through 6, 9, • • • values, according to the number of particles, 
instead of 3. These commutation rules are the necessary and 
sufficient condition that the dynamical law, which governs the 
time rate of change of the state vector j in system space, leads 
to the Hamiltonian equations for the “ canonical variables ” 
q^, pa representing the co-ordinates and associated momenta of 
the various particles composing the physical system — whatever 
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the dependence of the Hamiltonian function H on these quantities 
may be. 

In classical mechanics the Hamiltonian equations are invariant 
with respect to canonical transformations.^* In a system of 
/ degrees of freedom the transition from a set of variables q^, pa, 
describing the state to a set p’^ (a = 1, 2, • • •, /) is a 

canonical transformation if the difference 


Up'Jq: - Spjq. (11.5) 

is a total differential. If, for example, the are subjected to 
a transformation 

gx ^ Mg'i ’ ’ ‘ g'f) 


among themselves, the pa must transform as the components 
of a “ covariant vector ” in ^-space in order that the whole be 
a canonical transformation (“ extended point transformation ’') : 


Pa 



' Pp- 


Perhaps the simplest canonical transformation is that in which 
the roles of q and p are interchanged : 


Pa ga, 


g'a 


-pa- 


The canonical transformations constitute a group [cf. Ill, § 1]. For 
the identity, i.e. the transition from {p, q) to {p, q), is a canonical 
transformation ; the inverse (/>', q') -> {p, q) of a canonical 
transformation {p, q) {p', q') is also canonical ; and from the 
canonical transformations {p, q) -> {p', q'), {p’, q') -> {p”, q”) 
it follows that the resultant transformation {p, q) -> (/>", q”) 
is also canonical, for if 

Ep'adq'a - ZPadqa, I pX - EpX 


are total differentials their sum 


EpUg': - EPadga 

is also. 

An infinitesimal canonical transformation is one in which 
p\ q' differ infinitely little from p, q. We can consider it as 
an infinitesimal deformation of the 2/-dimensional (/>, ^)-space 
which takes place in the infinitesimal time interval e = St. We 
introduce the components Sq of the displacement vector by 
means of the equations 

P'a — Pa ^Pa, ” ^o. ^ S • 8^^ • 
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Since (11.6) must be a total differential, 

2:Kdq:+ Zq.dp^=^dT 


(11.6) 


must also ; in our case T must differ only infinitesimally from 
We may therefore write 

T=Zp^q:-BS; 


considering 5 as a function of />« and q'^ we have, in accordance 
with (11.6), 


P'a = P.- 



= q'.~ 


iS 


e 




or 


8 ,. = 5 ^. 


(11.7) 


Since we may legitimately neglect terms of order e*, we may 
identify with on the right-hand side of these equations. 
We call S the generating function of the infinitesimal canonical 
transformation. 

In accordance with the Hamiltonian equations, the state 
of a system, represented by a point (/>, q) in (/>, f7)-space, goes 
over into a state {p + dp, q + dq) during time dt. If we follow 
this transition for all possible initial states (/>, q) we obtain an 
infinitesimal deformation of the space whose points represent 
the state of the system, llte Hamiltoniaii equatiois assert that 
this deformation is an mfiniiesimal canonical transformation ivith 
generating function H • dt. It follows from this without any 
calculation that these equations have a significance which is 
independent of any particular choice of canonical variables. 

Now in quantum theory the Hamiltonian equations (11.3) 
assert that the state vector 5 in system space undergoes the 
infinitesimal unitary rotation 

(8.1) 


so the infinitesimal canonical transformation of the quantities 
/), q is here obtained by subjecting the argument J in the Her- 
mitian forms representing them to the infinitesimal rotation 

5.85 = 


We find that the increments of the quantities p^, q„ are in fact 
6 . = j{Sq^ - q^S), 6.8/).= '|(5/). - p^S), 


7 
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and, in virtue of the commutation relations (11.4), this agrees 
exactly with (11.7). On generating a finite canonical trans- 
formation by the successive application of an infinity of in- 
finitesimal ones we arrive at the result that the unitary corre- 
spondences of system space on itself in quantum theory : 

I' 

correspond to the canonical transformations of classical mechanics ; 
more precisely, only those for which the matrix U is expressible 
in terms of the matrices p, but we may for the present pass 
over the question as to whether every matrix U can be obtained, 
or at least arbitrarily closely approximated, in this way. Since 
the commutation rules (11.4) remain unchanged under rotations 
of the normal co-ordinate system, they are valid for an arbitrary 
set of canonical variables. This is also evident from the fact 
that they are the conditions that the dynamical law (8.1) lead 
to the Hamiltonian equations 

^ ^ ^ ^ ^ fll 3) 

dt 'bpf dt 'dqf 

The general procedure for the quantum mechanical treat- 
ment of a physical system suffers from the disagreeable fact 
that the expression for the energy in terms of the canonical 
variables must be taken from the classical model, and in ad- 
dition the transition to quantum mechanics is even then not 
unique, for the model offers no means of telling whether a 
monomial such as p^q is to be interpreted as p^q, pqp, qp^ or 
a linear combination of all three [cf. IV, § 14]. The provisional 
character of such a procedure is clear, but the results so far ob- 
tained seem to justify the hope that the path we have entered 
upon will lead to a unique formulation of the laws governing 
the actual physical phenomena. We need then concern our- 
selves longer with the general mechanical scheme. 

§ 12. Motion of a Particle in an Electro-magnetic 
Field. Zeeman Effect and Stark Effect 

Let the spatial co-ordinates xyz now be denoted by ^3 
and the time t by x^. If <f) is the scalar and c 21 the vector potential 
of the electro-magnetic field, then in the theory of relativity 

(- <f>, %, % ®,) = {<!>„ 

are the components of a vector in the space dual to the 4-di- 
mensional world. Let 

p = ^ . 
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^ 10 . ^"201 P 30 the components of the electric field strength 
cfFga, Fai, F 12 ) the components of the magnetic field strength 
Denoting the components of the velocity of a particle by 
^ 1 , ^ 2 , ^ 3 . *ts proper time is 

ds — V dt^ — (dx\ dxl + dxf) jc^ 

-- dtVl — v^jc^ (y* = v\ + v\-\- vl). 
dx 

With the world vector = -jj is associated the dual with 
components 

Uj, = (r = 1 , 2 , 3), Uq == — c^u^. 

The invariant equations of motion for a particle of mass tn and 
charge — e are 

= - . i 

ds ft ^0 

or 

‘^-^^ = -e(^F,o+J^F,,v,) ( 7 = 1 , 2, 3). (12.1) 

The right-hand side is in fact the ponderomotive force 

- .(g +J| 0 .tll), 


These equations arise from the Hamiltonian function 

H = ecf>, + c Jnih^ -f 1 {p, T- 
' « - 1 


in which x^XaXa ; pipapz arc the canonical variables, 
the Hamiltonian equations 


yield 


_ dxi __ iH _ dpi 4 - e<f>i) 

pi 4- e<f>i = mui ; 


in the remaining equations 

dpi "bH ^ f ^^0 I ^ ^ Pk ~\~ 

dt bXi~ kii'bXi V 


( 12 . 2 ) 
In fact, 


the left-hand side is 

d{mUi) 

dt 


+ 1 . 
I bXg i 1 



But this is the desired equation (12.1) : 

d(mui) _ (7^ _ I y \ 

dt \\bXi bxj \bXi bxj */■ 
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The negative energy — H is the time component of the dual 
vector whose space components are the components of linear 
momentum 1? = {pi, p^, P 3 ), so the equation (12.2) can be written 
in the rational form 

4(/>o + ecfio)^ - E {pi + e<f>,Y = mh^. 

0 i = 1 

From this we obtain the simple rule ; The influence of an electro- 
magnetic field on a particle of charge — e can he expressed by re- 
placing p„ by po, + e<f>„ in the equations of motion for a free particle. 
On going over to quantum theory p^ becomes the operator 

- ^ and is contragradient to the 4-dimensional displacement 

dx^y as is seen from the equation 

Our rule is now : On introducing a field of potential <f>„ 

- — must be replaced by ^ (12.3) 

OX^ ft 

in the wave equation of the particle. Only ifiifj has a simple physical 
significance ; it is therefore to be assumed that the laws which 
govern i/j remain invariant on replacing ip by e'^ • ip, where A is 
any real function of position in space-time. On the other hand, 
in the classical theory of the electro-magnetic field only the 
field strengths, and not the potentials, have an objective signifi- 
cance, i.e. the laws are invariant on replacing (p^ by — — , 

where jx is also an arbitrary function of the x^. On examining 
our wave equation for these invariantive properties we find 
that it is not invariant under each of them separately, but that 
there must exist a certain relation between A and fx. The field 
equations for the potentials ip and (p of the material and electro- 
magnetic waves are invariant under the simultaneous replacement 

of 

if, by e'^ • if, and (f>^ by (f>„ — ^ ^ ; 

e ox^ 

here A is an arbitrary function of the space- time co-ordinates. 
This “ principle of gauge invariance ” is quite analogous to that 
previously set up by the author, on speculative grounds, in 
order to arrive at a unified theory of gravitation and electricity.*® 
But I now believe that this gauge invariance does not tie to- 
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gether electricity and gravitation, but rather electricity and 
matter in the manner described above. We shall discuss this 
principle more thoroughly in Chap. IV ; its significance and 
its interpretation will then be more apparent. 

On passing to the limit c ->■ oo in (12.2), after separating 
out the factor me®, we return to ordinary mechanics : 

W 

On neglecting terms which are quadratic in the <f>i, we find, in 
addition to the kinetic energy Zpfl^^’ ^^e potential 

i 

= - 4 + i (psi). (12.4) 

We have already made use of the first part, that due to the 
electric field, in § 5. If we have, in addition to the field originat- 
ing in the nucleus, a homogeneous electro-static field in the 
direction of the z-axis and of strength F, for which <f> — — F • z, 
it adds the perturbation term 

W = eF‘z 


to the energy. A homogeneous static magnetic field § is 
obtained from the vector potential c% — ^ [§rj, t = {x, y, z) ; 
this adds to the energy the perturbation term 




2mc 


iitm), 


i.e. 




(12.5) 


Zeeman Effect . — If the homogeneous magnetic field strength, 
of magnitude [§[, is in the direction of the ;s-axis, the per- 
turbation term is 

W^ko-L.. o = ‘M (12.6) 

On choosing the characteristic functions ^^2*^ as our co-ordinate 
system in the system space of the functions tfi, W, as well as 
the energy of the unperturbed atom, is in diagonal form ; in 
the state defined by nl, m it has the value 

ho • m. 


(12.7) 
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The components {nl, m) {n'l' , m'), consistent with the selection 

rule for m, into which the line with frequency ^ (^ni — ^n'l') 

is broken up give rise to but three lines : one corresponding to 
all the transitions m m, which is linearly polarized in the 
direction of the z-axis and is undisplaced ; one which is circularly 
polarized perpendicular to the z-axis, the frequency v of which 
is displaced by + o (»i ^ w — 1) ; and one which is circularly 
polarized in the opposite sense, with frequency v — o instead 
of V (jM -> w + !)• This normal Zeeman effect is found only 
in the so-called singlet lines. 

Stark Effect. — In accordance with the general perturbation 
theory, the displacement and resolution of terms in the presence 
of a homogeneous electric field is determined, to terms of first 
order, by the matrix 

eF • < 2 > . 

In consequence of the selection rule / ->■ / ± 1, (z) — 0, unless 
accidentally all energy levels whose azimuthal quantum numbers 
differ by 1 coincide. Ignoring this exceptional case, we should 
expect to find no 1*' order perturbation effect increasing linearly 
with the field strength F {linear Stark effect), but only a quadratic 
effect, which is much smaller. This is in agreement with the 
experimental data on alkali atoms. Hydrogen is, however, 
degenerate, since for it energy levels with the same principal 
quantum number n and / = 0, 1, • • -, n — 1 coincide. The 
calculations for this case have been carried out by Schrodinger 
and compared with experiment. 

§ 13. Atom in Interaction with Radiation 

Following Jeans, black body radiation is mathematically 
equivalent to a system of infinitely many o.scillators. Maxwell’s 
equations for the free ether arc 

div § = 0, curl ® + - ^ == 0 ; 

c ot 

div @ = 0, curl ^ ^ = 0. 

In order to simplify the relations, we assume that the walls of 
the radiation cavity of volume V are reflecting ; then 6 is 
perpendicular to the walls at the boundaries of the cavity. 
Since the black body is at rest it is of no particular advantage 
to carry through the calculation in a relativistically invariant 
manner ; we may therefore normalize the vector potential 
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c% in such a way that the scalar potential vanishes. We then 

have 6 = — and the equations in the first row are satisfied 

dt 

by § = c • curl ; the equations in the second row become 
div?l = 0, ASI-^»^=0. 

On the boundary 91 is normal to the walls. Let the characteristic 
numbers and characteristic functions of the equations 

A31 + ^2^1 = 0, div 31 = 0, 

with the boundary condition that 31 is there normal, be denoted 
bv 

31. [a -1,2, 3,- . •], 
normalized in accordance with 


On setting 


{(31.31^)<iF = 47r8.^. 


S}1 


where the coefficients depend on time but not on position, we 
find for them the equations 

Introducing == in addition to the this equation is 
that for an oscillator with Hamiltonian function 




\{pr + \pi{qr 


we readily find on applying 

e = - 2* /’‘‘51., <0 = cEq'“ (^url 31. 

rt a 

that the energy of the radiation field is in fact given by 

F 

with this we have proved the theorem due to Jeans. For high 
frequencies p there are approximately 

Vp^dp 


(13.1) 
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modes of oscillation in the frequency interval p, p dp.^’’ We 
are interested above all in the limiting case of an infinitely large 
cavity ; the spectrum then becomes continuous and our formula 
for the density of frequencies becomes exact. 

On quantizing this mechanical system of infinitely many 
oscillators “ in accordance with the theory of the oscillator (§ 3) 
and the process of composition (§ 10 — but cf. remark on p. 109), 
we find as possible quantum states s, each of which is characterized 
by the fact that in it there is associated with each index a an 
integer n* ^ 0. In this quantum state 



or, on choosing the additive constant in the energy in such a 
way that the lowest energy value which the black body radiation 
is capable of assuming is 0, 

^ Pot) ^ ^^ot * ^ pa* 

<x 

In the language of photons this means that when the cavity 
is in the state s it contains photons of each kind a. The 
matrix element 

q‘1,, Ml , nj , • • , • ' • ; = w'l , «2 . • ' • •] 

vanishes unless all the equations 

”1 = . n^=^ n^, «3 = ”8 . • • • 

hold with the exception of n' = n*, which is to be 

= «« + 1 or n'„= n„ — 1. 

In the first case we have, by eq. (8.12), 

± _11 (Emission), 

'' ^Pot 

and in the second 

(Absorption). 

The first transition s -> s' consists in a photon of kind a springing 
into being, the second in the disappearance of one such photon. 
It follows from the above that in a transition for which , 4" 0 
all other q^^, must vanish. 

Let an atom 7mlh fixed nucleus and electric dipole moment q 
interact with the radiation field. Differentiate the quantum 
states of the atom from one another by means of the index n 
and denote the corresponding energies hy hv„\ then q = ll^lnn'H- 


replaced by 

(13.2) 

(13-2) 
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A quantum state of the total system consisting of both atom 
and radiation is characterized by the quantum numbers 

n ; ni , nj , • • ♦, n„ , • • •. 

The effect of the radiation on the atom is, in accordance with 
eq. (12.4) of the preceding paragraph, given to a first approxima- 
tion by the perturbation term 

eW - (q9l). 

It can be sliown that tlie addition of such a term to the 
Hamiltonian function of the total system will, according to 
classical theory, not only indicate an influence exerted on the 
atom by the radiation field, but will also modify the equations 
of Maxwell in a way which indicates that the motion of the 
electrons in the atom affects the radiation field. The per- 
turbation term will accordingly call forth emission as well as 
absorption. To a sufficient approximation we may take for 
its value at the point occupied by the nucleus, provided we restrict 
ourselves to radiation whose wave-length is large compared with 
the dimensions of the atom. We now have 

elf (13.3) 

a 

From this it follows than an element e • W„,, can only differ 
from 0 if ^ and s' are such that all with the exception 

of a single one , which must equal db 1- Then only the 
a**" term contributes to the sum (13.3), and we have 

(13-4) 

Bohr’s frequency condition, which asserts that the emission or 
absorption of a photon in state a with energy is associated 
with a quantum jump of the atom in which an amount 
± — ^n) ~ of energy is lost or won, need by no means 

be satisfied here. The finite cavity has its own frequencies p^, 
and may therefore be in no position to take up the frequencies 
associated with the quantum jumps of the atom. This is true 
in principle, but as a matter of fact, as wc shall see, Bohr's 
frequency condition is fulfilled to a very close approximation in 
the overwhelming majority of all transitions ; and this is more 
and more the case the larger the cavity is. 

Let the atom be in the state n and the radiation in the 
state .y — {n^}. Wc set 

Zhn^p^^V- U{p)dp, (13.5) 

where the sum on the left is to be extended over those indices a 
for which p^ lies between p and p dp ; hence U{p)dp is the 
energy density of the radiation contained in the frequency 
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range p, p -j- dp. In accordance with (10.5), the probability 
that the atom will find itself in the state »' after time t is given 
by 

1. y 1 — cos {v„„' + p,s')t 

A® (»'nn' + 


SW 


ns, n's' 


(13.6) 


The contribution to this sum due to the cases in which a photon 
is emitted is, in accordance with equations (13.2), (13.4), given 
by 

2^1 cos (v^„' ^ ^(^a ”l~ 1) 

(»'««' —p«)^ ^pct 




(4„„,Sl„) ^ (13-6.) 


and that for absorption by 

cos jvnn' + 


- r- 


(qn„'9l«)l^ (13.6. 


(I'nn' + Pa)^ 

Consider first the case in which the term level is higher than 
^n ; ^nn' = ~ ^ is then negative. We now collect 

together all those terms a in the sum (13*6a) for which lies 
between p and p + dp. Since the position of the atom is not 
exactly fixed — even in consequence of the variations caused by 
the emission of photons — we may, for small wave-lengths, 
replace 21^ by its mean value 47r/F as given by the normalizing 

equation = 47r, and we may also assume that all 

directions are equally probable for 21^. The square |(2l«q)|^ of 
the scalar product of 21 with a fixed vector q has then the mean 

value ^ • Iql^. (13.6^) then becomes 
o y 

1 — cos (p — v)< Itt |q„„.|2 Zhn^Pc 
[p-vY ‘ 3 F ■ 2p2 • 

On introducing (13.5) the sum (13. 6^) may, to a good approxima- 
tion, be replaced by the integral 

f 1 — cos {p — v)l U{p)dp 

3 J ■ {p- vY p^ • 

Essentially the only elements which contribute to the value of 
this integral, for a time t large in comparison with the duration \jv 
of an oscillation, are those for which p lies near to y. On developing 

U{p) __ U{y) 


in powers of p 


p‘ y‘ ' 

V, the first term in the expansion contributes 

+ 00 


U{y) 


f 1 - C( 

J 


cos X 


dx — nt 


U{v) 


(13.7) 



ATOM IN INTERACTION WITH RADIATION 107 


to the integral ; all others are to be neglected. Similarly the 
entire amount (13.6e) due to emission is negligible, for its de- 
nominator {p + vy vanishes nowhere. This means that the 
transition is almost invariably associated with the absorption of 
a photon whose frequency lies very close to v. The probability 
that tlie atom will appear in the higher state n' after lapse of 
time t increases in proportion with t ; the factor 


U{u) , , 
3 {hvy 


477 ^ 

3 ^ 




is the probability that the transition n -> yi take place in unit time. 
This formula was obtained for the case in which the state 
n possessed a liigher energy level than n. In the reverse case 
only the sum (13.6^) due to emissions contributes an appreciable 
amount. We now put Vnn^ ~ — ^n' “ ^ obtain the same 

formula with this difference : in place of n^ we now have -j- 1, 
or in place of the sum (13.5) the sum 

Zh[n^ + 1 )Pj. = Shn^Pa + Zhp^. 


The first is V • U{p)dp, and we denote the second by V • ii{p)dp. 
This latter is equal to (hp) times the number of modes of vibra- 
tion of the cavity within the frequency interval p, p dp ; hence 
by (13.1) 


V • u{p)dp ^ 


hp^dp 




hv^ 

ttV 


The probability that the atom drop from state n into the lotver state 
n' in unit time is given by 




The additional term u{v) is characteristic for spontaneous 
emission. When the radiation is not enclosed in a black 
body, i.e. when there is no radiation density Uiy), the proba- 
bility that the atom drop from the state n to the lower state n' 
in unit time^ emitting thereby a photon whose frequency lies 
in the immediate neighbourhood of p ^ ~ 

This agrees with the formula obtained by integrating (8.11) 
over all directions. The probability that the atom jump from 
the level n into a higher level n' (v„- > I'n) under these same 
conditions is zero. 
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In the energy field of the black body radiation we find not 
only absorption, but also “ stimulated emission” both of which 
are proportional to the energy density U{v). On setting 



(13.8) 


the probability for a jump from state n to a higher state n' in 
unit time is 




A. 


U{v) (v = — v„), 


(13.9) 


and the probability for the inverse jump, the drop from n' 
to n, is 


^n'n[U(v) + m(v)] 


(13.9) 


Since is an Hermitian matrix. 


J A 


(13.10) 


If there are a number of atoms in the radiation field and the 
whole system is in a steady state, then on the average as many 
atoms must make the jump n ^ n' in unit time as make the 
inverse jump «' -> n. On denoting the number of atoms in 
the state n by N^, these considerations are expressed in the 
condition 


A„n' ‘ N„U{v) — A N„'[U{v) + I((v)] 


or 


iV„< 


1 + 


uw 


(13.11) 


The probability coefficients A„n> = A„,„ have entirely dis- 
appeared — or rather, almost entirely, for the equation is valid 
only under the assumption that A„n' 0 or =]= 0, i.e. 
the transition n:$.n' is not to be forbidden by the selection 
rules. But for such a system in thermal equilibrium N„ must, 
as shown by Boltzmann, be proportional to 

g- «„/<•» _ 

where 9 is the temperature and k the Boltzmann constant. 
Equation (13.11) then becomes 

u{v) 


gHon' - = 1 -f 

or the Planck radiation formula : 

TTf \ 


U{u) 
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this formula is valid for all frequencies whose energies can be 
exchanged by the absorbing and emitting atoms in accordance 
with Bohr’s frequency condition.^® 

We have thus finally returned to the historical origin of the 
quantum theory. We must now add three remarks concerning 
this treatment, due to Dirac, of energy exchange between matter 
and radiation. In the first place, it is able to explain the fact 
that the spectral lines are not sharpy but possess a natural breadth.^^ 
Secondly, we must inquire what causes this difference between 
absorption and emission, processes which are transformed into 
each other on changing the direction of time. Indeed, the 
fundamental mechanical and field laws are invariant under the 
transformation /-> — /! The answer is that this difference is 
due to the preferential direction in time involved in the application 
of the theory of probability ; we assume a fixed initial state and 
calculate, with the aid of transition probabilities, the distribu- 
tion over the various states at a later time, not the distribution 
which would result from the equations for an earlier time. If 
no assumption is made concerning this preferential direction, 
t should be replaced by |^| in (13*7). And finally, the fact that 
we have here treated Maxwell’s equations as classical equations 
of motion, and as such have subjected them to the process of 
quantization, may give rise to serious doubts — for in our general 
formulation Maxwell’s equations are already the quantum- 
theoretic wave (‘({nations for the photon ! But we shall see 
in Chap. IV, § 11, that this method is in fact the correct one 
to employ in order to go from one c(^rpuscle to au indefinite 
7iumber of corpuscles. For since the number of photons must 
remain indefinite -as a photon can, in contrast to an electron, 
spring into being or disappear — the method of composition 
described in § 10 is not applicable to them. 
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GROUPS AND THEIR REPRESENTATIONS 

§ 1. Transformation Groups 

T he concept of a group, one of the oldest and most 
profound of mathematical concepts, was obtained by 
abstraction from that of a group of transformations.^ 

A point'field, a domain of elements which we call points, 
on which the transformations operate, underlies the trans- 
formations. This point-field may be either the totality of a 
finite number of individually exhibited elements or an infinite 
set, in particular a continuum such as space or time. A 
mapping or correspondence S of the point-field on itself is 
determined by a law which associates with each point p of the 
field a point />' as image : p p' Sp \ two correspondences 
Sp and Tp are identical if for all points p the two image 
points Sp and Tp coincide. If the point-field contains a finite 
number of elements the correspondence .S can be defined by 
giving explicitly the image point for each point p ; for infinite 
sets, however, the association is only possible by giving the 
law of the function S. 

Among such correspondences there is a particular one which 
associates with each point p the point p itself : p p \ it is 
called the identity L Two correspondences can be applied 
successively : if the first sends the arbitrary point p into p' ~ Sp, 
the second p' into p"' = Tp\ then the correspondence resulting 
from the composition of the two is defined by the association 
p p" ~ T[Sp) and is denoted by TS (read from right to left !). 
The resultant correspondence depends on the order of the two 
factors 5 and T. In order that composition be possible it is 
essential that the correspondences are ones which map the 
point-field on itself, and not on another point-field. 

We shall restrict ourselves to one-to-one correspondences S : 
the image points p' == Sp associated with p shall always be 
distinct, and each given point p' shall appear as the image of 
one (and only one) of the points p. Consequently such a one-to- 
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one correspondence S : p determines a second, the inverse 
5"^ : p' of 5, which just cancels it : 

S'{Sp)=p, S{Sy)=p^ or 

5 -^ 5 - /, - /. 

The inverse of S~^ is again 5 and the identity I is its own inverse. 
The resultant TS of two one-to-one correspondences 5, T is 
itself one-to-one, and its inverse is {TS)~^ — — for 

on inverting the correspondences p p' there results 

p" p' -> p. Henceforth we shall consider only those corre- 
spondences, also called transformations or substitutions, which 
are one-to-one. In this domain we have, in accordance with 
what has been said, the two fundamental operations of inversion 
and composition. 

Examples. — 1. Let the point-field consist of n elements 
exhibited individually ; bring them into a particular order by 
numbering them with the integers 

1, 2, • • •, n. (1.1) 

This numbering consists in a one-to-one reciprocal relation 
between the elements of the point-field and the integers or 
possible “ positions ’* q in the scries (1.1). A permutation con- 
sists in the transition from one such arrangement to another. 
If wc wish to operate in space we may think of the positions as 
fixed compartments into which the movable elements can be 
laid, or, conversely, we may think of the elements as fixed and 
shift the movable numbers about. With each permutation is 
associated a one-to-one correspondence p -> p' which tells 
which element />' occupies, after the exchange, the position 
previously held by p. Insofar as the method of numbering is 
considered as left to convention, the permutation is nothing 
more than this one-to-one correspondence. The concept is to 
be understood in this way when we arc concerned with the 
composition or successive application of permutations. 

2. A kinematical example of a group is offered by the motions 
of a space-filling substance, in particular those of a rigid body. 
The positions or numbers of the preceding example are here 
represented by the material points and the point-field is the 
space itself. The one-to-one correspondence p p' connects 
the initial with the final state : that material point which origin- 
ally covered the spatial point p is taken to the point p' by the 
motion. Congruent correspondences of space on to itself will 
also be briefly referred to as “ motions ” in the geometrical 
sense. 

The concept of a group of transformations is now readily 
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formulated. We understand by it any system @ of transforma- 
tions of a given point-field, which is closed in the sense of the 
following conditions : 

1. It contains the identity ; 

2. If 5 belongs to ©, then its inverse S~^ does also ; 

3. The resultant TS of any two transformations 5, T of @ 
is also a transformation of 

As examples we name the group of all n ! permutations of n 
things, the congruent mappings or “ motions ” of 3-dimensional 
Euclidean space» all homogeneous linear transformations in 
n variables with non-vanishing determinants (affine correspond- 
ence of an n-dimensional vector space) and the group of unitary 
transformations in n dimensions. 

If the point p goes over into p' by means of a transformation 
of the group then />' is said to be equivalent to p (with respect 
to the group &). The same concept is applied when we are 
considering instead of a point p a figure consisting of points. 
Expressed in these terms, the three requirements for a group 
are nothing other than the three axioms of equality : 

1. is equivalent to p ; 

2. If p^ is equivalent to then p is equivalent to />' ; 

3. If p' is equivalent to p and p" to p\ then />" is equivalent 
to p. 

According to Klein's Erlanger Program ^ any geometry of 
a point-field is based on a particular transformation group ® 
of the field ; figures which are equivalent with respect to 05, 
and which can therefore be carried into one another by a trans- 
formation of are to be considered as the same. In Euclidean 
geometry this role is played by the group of congruency trans- 
formations, consisting of the motions referred to above, and 
in affine geometry by the group of affine transformations, etc. 
The group expresses the specific isotropy or homogeneity of the 
space ; it consists of all one-to-one “ isomorphic correspondences ” 
of the space on itself, i.e. those transformations which leave 
undisturbed all objective relations between points of the space 
which can be expressed geometrically. The symmetry of a 
particular figure in such a space is described by a sub-group of 
% consisting of all transformations of @ which carry the figure 
over into itself. The art of ornamental tiling, which was per- 
fected by the Egyptians, contains implicitly considerable know- 
ledge of a group-theoretic nature ; we here find, perhaps, the 
oldest fragment of mathematics in human culture. But only 
recently have we been able to formulate clearly the formal 
principles of this art ; attempts in this direction were already 
made by Leonardo da Vinci ^ who sought to give a general and 
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systematic account of the various types of symmetry possible 
in a building. But the most wonderful symmetrical structures 
are exhibited in crystals, the symmetry of which is described 
by those congruency transformations of Euclidean space which 
bring the atomic lattices of the crystal into coincidence with 
themselves. The most important application of group theory 
to natural science heretofore has been in this field. 

The following considerations fit naturally into the present 
discussion. Let the point-field M on which the transformations 
S of the group ® operate.be mapped on the point-field N by 
means of the one-to-one correspondence A : p -> q \ the case 
in which the correspondence serves to introduce new numbering 
or new co-ordinates is of particular importance. Through this 
correspondence of M on N the transformation 5 of M becomes 
a transformation T of N ; in the particular case mentioned above 
T is simply a description of the transformation S in the new 
co-ordinates. It is evident that to the composition of trans- 
formations S corresponds the composition of the corresponding 
transformations T of N and that a group & of transformations S 
goes over into a group ^ of transformations T. The relation 
between these two transformations is 

T - ASA-\ (1.2) 

for if we denote the transformation S hy p p' and if q' are 
the points of N associated with />, p' by Ay then the transforma- 
tion q q of N is effected by 

^ ^ p y. ^ 

We may also write .'p = In particular, these considera- 

tions apply when N and M arc the same point-field. 

§ 2. Abstract Groups and their Realization 

An arbitrary number of transformations of a given point-field 
on to itself can be applied successively ; we are of course not 
restricted to merely two. But when we perform this process 
step by step it is automatically reduced to a succession of com- 
positions of transformations taken two at a time : 

ABC • • • = A[B(C • • •)]• 

This possibility of performing an extended composition in steps 
involving but two transformations at a time shows that the 
associative law 

{AB)C^ A{BC) 

holds for any three transformations A, B, C. 
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The structure of a transformation group is obtained from it 
by abstraction when we allow the transformations themselves 
to degenerate into elements of an immaterial nature, retaining 
only their individuality and the rules in accordance with which 
two given transformations are composed, in a given order, to 
form a third. In accordance with what has been said such 
composition necessarily obeys the associative law. Perhaps it 
also obeys other universal laws, but since we have at present 
no indication of this we attempt a formulation of the abstract 
structure of the group by means of the following definitions : 

An abstract group is a system of elements within which a law 
of composition is given such that by means of it there arises from 
any two {the same or different) elements a, b of the group, taken in 
this order, an element ba. The following conditions shall thereby 
be satisfied : 

1. The associative law c{ba) — {cb)a ; 

2. There shall exist an element I, the unit element, which leaves 
an arbitrary element a unaltered 07i compositioyi with it : 

la == al = a. 

To each element a shall exist an mverse a~^ which yields on 
compositioyi with it the unit element I .• 

aa“^ — a"^a = I. 

Such an abstract group is not to be confused with its reali- 
zation by transformatioyis, i.e. by one-to-one correspondences of 
a given point-field. A realization coyisists in associating with 
each element a of the abstract group a transformation T{a) of the 
point-field in such a way that to the composition of elements of 
the group corresponds compositioyi of the associated transforma- 
tions : 

T{ba) = T{b)T{a). (2.1) 

It follows from this that to the unit element I corresponds the 
identity / and to inverse elements a, a~^ correspond inverse 
transformations : 

T{a~^) - r-i(a). (2.2) 

The first assertion follows from the particular case 

T{a)m = T{a) 

of (2.1) by left-handed composition with the reciprocal of the 
transformation T{a) ; (2.2) is then contained in (2.1) as the 
particular case b — The realization is said to be faithful 
when to distinct elements of the group correspond distinct 
transformations : 


T{a) =# Tip) when a ^ b. 
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In accordance with the fundamental equation (2.1) the necessary 
and sufficient condition for “ faithfulness ” is that T[a) shall be 
the identity only if a is the unit element. For if a, b are two 
elements of the group it then follows from T[a) = T{b)^ i.e. 

T[a)T-\b) = T{a)T{b-^) - T{ab--^) = / 

that under these conditions ab^"^ ~ I, i.e. a = b. If the abstract 
group is obtained from a transformation group & by abstraction, 
then conversely is a faithful realization of it. 

In the study of transformation groups we always deal with 
two manifolds, the structureless point-field and the manifold of 
group elements, the structure of which is expressed by the law 
of composition. The original problem thus resolves itself into 
two ; the examination of the various group structures possible 
and the examination of the possibility of obtaining realizations 
of the given abstract group by transformations of a given point- 
field. The historical development of the subject has shown that 
it is advantageous to effect this division into two problems ; 
they are of fundamentally different character and require 
fundamentally different mathematical equipment for their 
discussion. 

In accordance with our method of introducing the abstract 
group, which we henceforth refer to simply as the group, it 
serves merely to give the structure of the group ; the nature of 
its elements is immaterial. This abstraction from the nature 
of the elements is expressed mathematically by the concept of 
isomorphism. If we have two groups g, g' and there is as- 
sociated with each element a of g an element a of g' in a one- 
to-one way : a a\ such that 

{bay - fcV, (2.3) 

then the two groups are said to be simply isomorphic. Simply 
isomorphic abstract groups offer no means of distinguishing one 
from the other. The concept of isomorphism can, of course, be 
applied to transformation groups. Two isomorphic transforma- 
tion groups can be considered as faithful representations of 
one and the same abstract group. A group may be isomorphic 
with itself ; it is then said to be aiitomorphic. Such an auto- 
morphism occurs when g and g' coincide, i.e. when a one-to-one 
reciprocal association a'^ a' satisfying the condition (2.3) is 
established between the elements of the group g. 

The question arises whether or not every abstract group 
possesses a faithful realization. If this were not the case the 
concept of an abstract group as developed above would be too 
broad — there would exist, in addition to the associative law, 
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other purely formal laws for the composition of transformations 
which are satisfied by every transformation group. Conversely, 
a proof of the realizability of any abstract group would tell us 
that all that can be said about the formal laws for the com- 
position of transformations is contained in our conditions (1) 
to (3). We can, in fact, construct a faithful realization of any 
abstract group g by taking as the point-field the group manifold 
itself and letting correspond to each element a of the group 
the transformation 

s s' — as 

of the group manifold on to itself. This “ left- translation ” 
ta is obviously a one-to-one reciprocal transformation which 
has as inverse the transformation s = If a and b are 

distinct elements the corresponding transformations ta, are 
distinct, for they allow the unit element I to correspond to the 
distinct elements a, b respectively. If we perform in succession 
two left-translations 

s s' = as, s’ -> s" = bs' 

the resulting transformation is, in consequence of the associative 
law, 

s s" = b{as) = {ba)s. 

Consequently the left-translations constitute in fact a faithful 
realization of the abstract group. However, the right-trans- 
lations behave otherwise, for if we denote the mapping 
s ^ s' = sa of the group manifold on itself by t*(a), we find 
instead of (2.1) the equation 

t*{ba) = t*{a)t*{b). 

§ 3. Sub-groups and Conjugate Classes 

A sub-group g' of a given abstract group g is a set of elements 
contained in g which itself fulfils the characteristic group con- 
ditions : the unit element I belongs to g', with a belongs also 
a~^ and with a, b also ba. These three conditions can be reduced 
to the one : li a, b are any two elements of g', then ba~^ also 
belongs to g'. We assume, of course, that the partial system 
consists not merely of the element I, but the other limiting 
case, in which g' coincides with g, shall be included under the 
concept of a sub-group. 

Examples are readily found. In the group of Euclidean 
motions are contained, for example, the group of rotations 
(which leaves one point, the centre, fixed) and the group of 
translations. The unitary transformations constitute a sub- 
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group of the complete group of all homogeneous linear transforma- 
tions ; the even permutations a sub-group of the group of all 
permutations. If we are dealing with a transformation group 
all those transformations of @ which leave a particular 
point p fixed (i.e. which carry p over into itself) constitute a 
sub-group Instead of a point p the fixed element may be 
any figure composed of points ; the transformations of the sub- 
group must either leave the figure as a whole fixed (i.e. they must 
carry each point of the figure over into another such) or the 
more restrictive condition that they leave each point of the 
figure fixed. We can also obtain sub-groups of @ by employing 
invariant functions instead of invariant figures. If ^{p) is any 
function of position on the point-field with elements p we as- 
sociate with the transformation S : p p' the function 0' 
defined by ^'{p') ~ ^{p) and say that it is obtained from ip by 
the transformation 5. If p' ~ Sp^ p" — Tp\ the equations 

m = f '(/>") 

show that the composition of the transitions ip ip' and 
Ip' ifi" associated with S and T result in the transition ip ip" 
associated with TS. Now consider all transformations 5 of @ 
which carry ip{p) over into itself, i.e. for which p{Sp) ~ ip{p) is 
an identity in p ; they constitute a sub-group of ©, and 
ip{p) is an invariant of §. In this way we can separate out 
the rotations from the homogeneous linear transformations by 
requiring the invariance of the unit quadratic form. The sub- 
groups contained in a finite group g, which is described by 
exhibiting each of its elements and giving explicitly the result 
of composition of each two, can be obtained by inspection. 

There is associated with each element a of the group Q a 
cyclic snb-group denoted by {a) : 

a-i, a«=l, a*, • • •, (3.1) 

the elements a" of which arc defined inductively by the equations 

a® ~ I, — a”a. 

These elements constitute in fact a group, for n and tn being 
any integral exponents we have 

(a) is the smallest sub-group which contains a, i.e. its elements 
are common to all sub-groups of g which contain a. The 
elements of the set (3.1) can cither be distinct or — and this 
latter must be the case if g is a finite group— they must repeat 
themselves after a cycle of h terms : I, a, are 

distinct but ~ I. A is called the order of the element a. 
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The order of a finite group is the number of its elements ; 
accordingly, the order of an element a agrees with the order of 
the cyclic sub-group (a) generated by a. A group is said to be 
commutative or Abelian if composition of its elements obeys 
the rule ba ~ ab. Cyclic groups are therefore Abelian. 

If a runs through the sub-group ^ of Q the associated (left-) 
translations ta constitute a group of transformations which is 
simply isomorphic with 1^, the point-field of which is the group 
manifold. We say that two elements 5, s' which are equivalent 
with respect to this transformation group are (left-)equivalent 
with respect to and express this situation by the notation 
'' s' ^ s with respect to f) ” ; the condition for it is that s' = as 
where a is an element of f). In this way the elements of g are 
divided into sets of elements which are equivalent to If 
the number of such sets is finite, it is called the index of )) in g. 
If 9 is a finite group the number of elements in each of these 
sets is given by the order of I}, for different translations ta send 
s into different elements : as ^ bs \i a ^ b. The order of f) is 
accordingly a divisor of the order of g, and the quotient of these tioo 
is the index of f). 

The considerations at the end of § 2 above, which were 
developed for groups of transformations, suggest a second 
realization of the abstract group g. We associate with the 
element a the correspondence 

s —> s' ~ asa~^ (3.2) 

of the group manifold on itself. I'his correspondence, which 
we call the “ conjugation ” is reciprocal one-to-one, and has 
as inverse 5= ar^s'a. The law of composition is obeyed, for 
from 

s s' asa~^, s' -> .9" “ bs'b'^ 
we obtain the product 

5 -" — basa~^b^^ ~ {ba)s[bay^. 

Two elements .s*, s' of g arc said to be conjugate if they arc 
equivalent with respect to the group of all conjugations. Ac- 
cordingly, the whole group is divided into classes^ any element 
of one of which is conjugate to any other element of the same 
class. When we speak of classes within a group without a 
more explicit description we mean these conjugate classes. 

The realization of g by the group of conjugations is in general 
a “ contracted ” rather than a faithful realization. In particular, 
the conjugation coincides with the identity if a commutes 
with all elements .s of the group. The totality of all such ele- 
ments a is called the centra! of the group ; it is obviously 
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an Abelian sub-group of g. But this disadvantage of con- 
jugation over translation is offset by an advantage ; conjugation 
is an isomorphic correspondence within the group itself which 
leaves the unit element invariant and which associates with 
each sub-group of g another such, the conjugate sub-group 
These facts, which are expressed by the equation 

a{st)a'^^ = {asa''^){ata~'^), 

were already contained implicitly in the considerations at the 
end of § 1. i) is said to be a self ^conjugate or Invariant 
sub-group if it coincides with all its conjugate sub-groups. 

The importance of this last concept is best seen in the 
following : 

Theorem. 7/1) is an invariant sub' group and = denotes equiva- 
lence ivith respect to it^ then it follows from 

s'~s,t'^t that st'^st. (3.3) 

To prove this we note that 5 ' — as^ /' — bt (^, b in yield 

s't' = asbt = (ac) {st). (3.4) 

for c ~ sbs~^ belongs to f) with b. Since ac lies in our assertion 
is proven. It is readily seen that the invariantive nature of is 
necessary as well as sufficient for the validity of (3.3). In deal- 
ing with an invariant sub-group ^ we need not distinguish 
between right and left equivalence with respect to ^ — indeed, 
the above proof was based on this fact. 

We may, if we like, consider equivalent elements as not 
differing from one another (by application of the principle of 
definition by abstraction) ; but by thus allowing equivalent 
elements to fall together the group property of g is, in general, 
forfeited. In accordance with the above theorem it still remains, 
however, if is an invariant sub-group. The group obtained 
from g by identifying all elements which arc equivalent with 
respect to is called the factor group g / 1) ; its order is the 
index of the invariant sub-group i) of g. 

These concepts arc of assistance in examining the way in 
which a group may be “ contracted ” on setting up a realization. 
Let the transformation T{a) of a given point-field on itself 
correspond to the element a of the abstract group g in the realiza- 
tion under consideration. Then T{a) ~ T{a') if and only if a' 
is obtained from a by composition with an element e (i.e. a' — ea) 
for which T{e) is the identity. Such elements e obviously con- 
stitute a sub-group 1) of g, for it follows from 

T{e) - /, T{e') = I that T{e/) = T{e)r{e') - /. 
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is, in fact, an invariant sub-group, for if T{e) is the identity, 
the same is true of 

T{aea-^) - T{a)T{e)T-^{a) = T{a)r-^{a), 

In any realization of an abstract group Q by a group of trayisforma’ 
tions the elemejits of a certain invariant sub-group ^ of ^ correspond 
to the identical transformation ; tivo different elements will be 
associated with the same transformation if and only if they are 
equivalent zvith respect to t). The group of transformations is 
consequently a faithful realization of the factor group 

§ 4. Representation of Groups by Linear 
Transformations 

On requiring that the transformations which are to serve 
as a realization of a given abstract group g be linear and homo- 
geneous we arrive at a problem which is most fruitful from the 
mathematical standpoint and which is at the same time of 
greatest importance for quantum mechanics ; we then speak of 
a representation 9 instead of a realization, of the group. ^ An 
n-dimensional representation of g, or a representation of degree n, 
consists in associating with each element s of the group an 
affine transformation U{s) of the n-dimensional vector space 
91 =: 91^ in such a way that these transformations obey the 
law of composition 

U{s)U{i) - U{st}. (4.1) 

We then say that s induces the transformation U{s) in the 
representation space 91. On choosing a definite co-ordinate 
system in 91 each transformation U{s) is represented by a square 
matrix of n rows and columns, the determinant of which does 
not vanish. On replacing the original co-ordinate system by 
another, obtained from it by the transformation the corre- 
spondence which was formerly represented by the matrix U{s) 
is now represented by the matrix AU{s)A~'^. Consequently if 
the association s -> U{s) is a representation, the association 

s-^ AU{s)A -^ 

is obviously also one ; this latter representation is said to be 
equivalent to the former. They are essentially the same, 
differing only in the choice of the co-ordinate system in terms 
of which they are described. 

Examples . — A representation in one dimension consists in 
assigning to each element s of the group a non-vanishing number 
x{s) in such a way that 


= x(-y) x(<)- 


(4.2) 
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In particular, x(l) = 1- A most trivial 1-dimensional repre- 
sentation is obtained by assigning to each s the number 1 : 
x{s) = 1. This special case is called the identical representation. 

Consider next the so-called symmetric group, the group 
77 — TT/ of all /! permutations of /things. The association 

± 1 , 

according as 5 is an .even or an odd permutation, defines a 
1-dimensional representation, the “ alternating ” representation 
of the group tt. For the character which distinguishes 
between the even and the odd permutations, satisfies the 
equation 

= 8 < ' 8 (. 

Let g be a finite cyclical group of order h ; the elements 
s are then 

I, a, a*, • • •, 

and Uh I. Consider the 1-dimensional representation s -> xi^) 
in which xi^) condition (4.2) for a representation 

then tells us that to the elements s of this series correspond 

1 66^ ... 

and that to corresponds £^. Hence £^ — 1 ; e must therefore 
be an root of unity and the law defining the representation 
is £»* (r =: 0, 1, 2, • • •). Conversely, when e is an arbitrary 
root of unity this association defines a 1-dimensional re- 
presentation of (^. Wc have thus obtained a complete survey 
of all possible 1-dimensional representations of a cyclical group. 
The only example of a multi-dimensional representation 
which we offer at this time is the following trivial one. If 
0 is itself a group of linear transformations of an w-dimensional 
vector space then the association s s defines an ?t-dimensional 
representation of 0. This example implies more than one might 
at first sight imagine. We have in fact to do the following : 
we first obtain the structure of the group 0 by abstraction from 
the group of linear transformations and then return to the 
original realization by means of the correspondence s s 
between an clement 5 of the abstract group on the one hand 
and the linear transformation s on the other. 

The concept of equivalence has a more general significance 
than that discussed above. It may refer to an arbitrary system 
U of linear correspondences U of the n-dimensional vector 
space 91. We need not assume that these correspondences 
possess an inverse (i.e. that they have a non-vanishing deter- 
minant), nor need we assume that they are associated with 
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the elements 5 of a group, as is the case with representations. 
On expressing the set of correspondences U in terms of a new 
co-ordinate system each matrix U goes over into the matrix 
U' — AUA~^ ; the system E is transformed into the equivalent 
system S' consisting of the U'. A is here a fixed non-singular 
matrix. 

Consider a correspondence U of 9? on to itself. A linear 
sub-space 9i' of 9i is said to be invariant under U if the vectors 
of 91' are transformed into vectors of 91' by U. If 91' is invariant 
then the space 91 (mod. 9f') obtained by projecting 91 with 
respect to is also invariant (cf. I, § 2, in particular F'ig. 1). 
9i' being invariant, U gives rise to a correspondence U' of 5R' 
on to itself ; we say that U induces U' in 91'. Similarly for 
the space obtained by projection. We now pass from a single 
correspondence C/ to a system S of correspondences. 91' is 
said to be invariant under S if it is invariant under each corre- 
spondence U of S. Describing 9? in terms of a co-ordinate 
system which is adapted to the invariant sub-space 9i', all 
matrices U of the system S reduce simultaneously to the form 
illustrated in Fig. 1, p. 8. 27 is called irreducible if 9i con- 

tains no sub-space, other than 91 itself and the space 0 consisting 
only of the vector 0, which is invariant under 27. We shall 
have occasion to reduce 91 in such a way that each constituent 
separated off is irreducible under a given system 27. This 
requires the construction of a series of sub-spaces 

0, gii, (4.3) 

beginning with 0 and ending with 91, in which each member 
is contained in the preceding one and is such that 91,- (mod. 91, _i) 
is irreducible. Naturally 91, shall actually be larger than 9I,_i, 
not merely coincide with it. The implications of this reduction 
are most readily seen in terms of the matrices U of the corre- 
spondences of the system 27 on adapting the co-ordinate system 
to the “ composition series ” (4.3), i.e. by choosing first a co- 
ordinate system in 9ii, then supplementing it with additional 
fundamental vectors in order to obtain a co-ordinate system 
for 912, 9^3) ' • • in turn. 

27 is said to be completely reducible if 9i can be decomposed 
into two sub-spaces 91 91', each of which are invariant under 27 

and such that neither of them consists merely of the vector 0. 
This concept of complete reducibility is more exacting than that 
of mere reducibility. On describing 91 in terms of a co-ordinate 
system which is adapted to this decomposition, each matrix 
U of 27 assumes the form illustrated in Fig. 2, p. 9. We are 
then faced with the problem of decomposing ^ (or 27) into 
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constituents, none of which is completely reducible, i.e. of 
decomposing 9? ~ 91^ -f • • • 9J^. into invariant sub-spaces, 

none of which is completely reducible. 

We often find that reducibility implies complete reducibility, 
i.e. that in many cases we have the theorem : If 9?' is an in- 
variant sub-space of 9?, a second invariant sub-space 91" can 
be found such that 91 is completely reducible (with respect to 
2J) into 91' + 31''- We shall soon see that this is actually the 
case when 91 is a unitary space and Z* is a system of unitary 
transformations. 

It was shown in Chap. I, § 3, that if the system H is re- 
ducible, then the system Z* of “ transposed ” correspondences 
of the dual space on itself is also reducible. If § : 5 U{s) 
is an w-dimensional representation of the group g the transposed 
U*{s) do not constitute a representation ; it is readily seen, 
however, that on employing instead the contragredient corre- 
spondences 

Uis) = 

we do obtain a representation s -> U{s) of the dual vector space. 
This we call the contragredient representation 

§ 5. Formal Processes, Clebsch-Gordan Series 

Continuous groups offer what are perhaps the simplest 
examples of the theory of representations. We consider in 
particular the group C — C„ of all linear and homogeneous trans- 
formations s in 71 variables :r,, Xg, •••,%„ witli non-vanishing 
determinants ; we consider each set of values .r,- as a vector 
in an n-dimensional vector space t — T„. The classical theory 
of invariants, first developed in England about the middle of 
the last century, concerned itself in particular with the repre- 
sentations of C induced on the coefficients of arbitrary forms 
in the variables x^. A quadratic form in these variables is a 
linear combination of the n{n + l)/2 linearly independent 
products Xi Xfc ; under the influence of a linear transformation 
s of the Xi these products undergo a linear transformation [^jg, 
and the correspondence 5 [^jg is obviously a representation 
[c]^ in 7 i{n -f l)/2 dimensions of the group c. The transformation 
s of the variables Xi sends the arbitrary quadratic form 




into a quadratic form 


Xi Xu 



124 GROUPS AND THEIR REPRESENTATIONS 


in the new variables, where the coefficients are obtained 
from the by a certain linear transformation ^2 associated with 
s\ ^2 is obviously contragredient to [^jg. The quadratic form 
characterized by a fixed set of w(n + l)/2 coefficients may 
therefore be considered as a vector in a space of this number of 
dimensions, and the transformation s of the variables induces 
the transformation ^2 this space. The space thus defined by 
the totality of u-^lXY quadratic forms is thus the point-field for 
a group of linear homogeneous transformations which constitute 
a representation of the group C. 

We may in the same way deal with cubic, quartic, • • •, 
/-ic forms. The totality of monomials of order / are contained 
in the formula 

x{^ ^ (5.1) 

where the fi are non-negative integers whose sum 

/l + A + • * * + /n = /• 

They constitute the substratum of a representation [c]-^ in 
fn'l _ n{n + 1) • • • {n + f — 1) 

Ifi 1-2 •• • / 

dimensions. 

But we can exhibit representations of c which are formally 
yet simpler than these arising from the theory of forms. Let 
(Xi) and (y,) be two arbitrary vectors in our n-dimensional 
space r and consider the products x^ y^. On subjecting the Xi 
and the to the same transformation 5 of C (transition to a 
new co-ordinate system) the v} products undergo a certain 
linear transformation s Y. s associated with 5 and the corre- 
spondence 5 -> 5 X 5 is an n^-dimensional representation (c)^ of C. 
Now a system of numbers jp(z, /j), depending on two indices t, k 
which run through the values 1, 2, • • •, n, is said to be a tensor 
of second order if under the influence of a transformation s of 
r tlie F(i^ k) undergo the same transformation as the products 
Xi yjc of the components of two arbitrary vectors j, t) of t. Hence 
the tensors of order 2 are the substratum of the representation 
(c)* of C. (c)2 contains the representation [c]^ which is induced 
in the sub-space of symmetric tensors of order 2 ; the tensor 
with components F(t, k) being symmetric if Flik) = F[ki), 

In geometry the anti- symmetric tensors, i.e. tensors whose 
components satisfy the condition F{ik) = — F{ki), play a more 
important role than the symmetric ones.** In particular, two 
arbitrary vectors (a;,), (y,) define a surface element with 

components 

x[ik} = — XkVi ; 
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of these quantities but n[n — - l )/2 are linearly independent, 
say those for which i < k. On subjecting the components a;, 
of the vector j and the components 3 ', of the vector h to the 
same linear transformation 5, the components of the surface 
element defined by them undergo an n[n — l)/ 2 *dimcnsional 
linear transformation {s]^- s -> {^}2 is a representation {c}^ 
whose substratum is the totality of anti-symmetric tensors of 
order 2 . Hence the representation (c)^ is reduced into the 
representations [c]^ and {c}^, for any tensor F{ik) can obviously 
be written 

F{ik) = liF{ik) + F{ki)] + 1(F(») - F(<:i)], 

i.e. in a unique manner as the sum of its symmetric and anti- 
symmetric parts. That this reduction is correct is further borne 
out by the fact that the dimensionalities satisfy 

.,2 _ «(« + 1 ) , ”(" - 1 ) 

2 • 2 • 

Similarly three arbitrary vectors j, 1 ), j determine a 3-dimen- 
sional element of volume with components 

x{ikl} = y,- yk yi ■ (5.2) 

These elements constitute the substratum of a representation 
{cp in 

__ n(n — l)(n 2 ) 

Ui “ rT^"3 

dimensions. Continuing in this way we can construct 4-, 
5-, • • •, n-dimensional elements ; this process must cease with 
n-rowed determinants, for a determinant of the form (5.2) with 
more than n rows must necessarily vanish identically. 

We shall see that the representations of c whose substrata 
are the symmetric and anti-symmetric tensors of order / are 
irreducible, and shall in fact solve the general problem of effect- 
ing the complete reductions of (c)f the representation induced 
by C in the space of all tensors of order /, into its irreducible 
constituents (Chap. V), 

The tensor concept really depends on the X -multiplication 
introduced in II, § 10. If the m variables a;, undergo a trans- 
formation A and the n variables yj^ a transformation 5, then 
the mn products Xiyjc undergo a transformation A X B, Con- 
sidering the Xi as the components of an arbitrary vector j in 
an m-dimensional space and the y^ as the components of 
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^ in the products Xf • y* may be considered as the components 
of a vector j x ^ in an wn-dimensional vector space X 9?„. 
Hence two representations 

$ : 5 ^ Uis), ^ U'{s) (5.3) 

of g in m, n-dimensions, respectively, give rise to a new mn- 
dimensional representation which we denote by § X : 

^ X : s U{s) X U’{s). (5.4) 

This presents a general method of obtaining a new representa- 
tion X §' from two given representations §'. 

Denoting the representation 5 of the linear group c for 
the moment by (c), the representations of c whose substrata 
are the tensors of order 2, 3, • • • are then (c) X (c) — (c)*, 
(C) X (C) X (C) = (C)», • • •. 

We should, perhaps, have discussed the addition of two 
representations before discussing their multiplication X . Con- 
sider the variables Xi and yi- as the components of a single vector 
j in an (m + «) -dimensional vector space ; when the ar, are 
subjected to the transformation A and the y^ to the trans- 
formation B these m + « variables undergo a certain trans- 
formation {A, B). Hence we obtain from (5.3) the representation 

C) X ^’:s-*[U{s), U'{s)] 

in w + w dimensions. The inverse of this process is complete 
reduction, as discussed above : § + is completely reducible 
into the components § and 

Another important formal method is the following : Any 
representation F in ^V-dimensions of the linear group c„ in 
w-dimensions may be used to construct an A^-dimensional 
representation of any abstract group g from an n-dimensional 
representation § of the same. F associates with the linear 
transformation u in n-dimensional space a linear transformation 
U in N dimensions, so if § : .y is an w-dimensional repre- 
sentation of the group g with elements then 

s -> u U 

is an A-dimensional representation $ -> U of g which we may 
denote by To this is due the importance of the repre- 

sentations of the linear group for the general theory of repre- 
sentations. For example, take F to be the representation of 
C whose substratum is the dual space, the space of all tensors 
of order 2, of the symmetric or anti-symmetric tensors of order 2, 
etc. ; we then obtain from the representation § of the abstract 

group g the representation ^ X .f), [§ X §], {§ X §}, etc. 
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The three most important formal processes are (1) addition, 
(2) X -multiplication, and (3) the F process. The first two 
generate a new representation from one or two given repre- 
sentations, the third a new one from a given representation. 
The first two are completely circumscribed, but the third 
contains a general method, for F may be any representation of 
the linear group c„. 

If g' is a sub-group of g, then any representation 
§ ; 5 ->■ t/(5) of g contains a representation of g' ; we need only 
let the element s run through the sub group g' ! This too may 
be considered as a formal process (4) which generates a repre- 
sentation of g' from a given representation of g. 

The X -multiplication occurs in yet another connection. 
Given two groups g, g', we can consider the pairs ( 5 , s'), the 
first member ^ of which is an element of g and the second s' 
an element of g', as the elements of a new group g X g', the 
direct product of g and g', obeying the multiplication law 

is, s'){i, n = {St, s't'). 

The order of g X g' is the product of the orders of g and g'. If 
§ : s' -> U{s) is an «-dimensional repiesentation of g and 

: s' -> U'{s') an w'-dimensional representation of g', then 

(s, s') -> U{s) X U'(s') (6.5) 

is obviously a representation in nn' dimensions of the group 
g X g' ; we denote it by § X (with a boldface X). This 
construction may be broken up into two steps. First introduce 
the representation 

[s, s') -> U{s) 

of g X g' ; there is no reason why we should not designate it 
by the same letter § as the representation s -> U{s) of g — we are 
accustomed to calling the function /( a;), considered as a function 
of the two variables x, y, by the same letter as the function 
fix) of the single variable x. U(s) and U'is') are thus to be 
considered as functions of the same variable pair (j, s'), and then 
the representation § X of g X g' may be obtained by ordinary 
X -multiplication from § and The differentiation between 
boldface X and ordinary x is accordingly purely pedantic. 

Examples. Unimodular Group in Tivo Dimensions 

Let g = c = Ca consist of all linear transformations s of two 
variables x, y ; 

x' = ax by, y' = cx dy, ( 6 . 6 ) 
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whose determinant ad — be = 1 (“ unimodular ” linear trans- 
formations *). A homogeneous polynomial in x, y of order / is 
a linear combination of the / -f 1 monomials 

x^, xf~^y, ' • xyf~^, yf. (5.7) 

Under the influence of s they undergo a linear transformation 
which we denoted above by [ 5 ]/; they constitute the substratum 
of a representation [c]/ : s -> [ 5 ]/ in / -f- 1 dimensions which we 
now denote by ©/ is, although we have yet to prove it, 
irreducible. 

We can restrict ourselves within c to the sub-group Ci of 
"principal” transformations which transform each of the 
variables separately : 

x'^ax, y' =--\y, (5.8) 

(X 


where a 4 = 0 is an arbitrary constant, is Abelian. This 
transformation multiplies the monomials of the set (5.7) by 

af, • • *, arL 

On associating the number aJ' with the element (5.8) of Ci we 
obtain a 1 -dimensional representation which we denote for the 
moment by ; here r can be any fixed integral exponent. 
We have just seen that the irreducible representation (5/ of Cj 
is completely reduced on restricting ourselves to the sub-group 
Cl into / -f- 1 one-dimensional representations with r — /, 
/ — 2, • • ', — /. This is an example of the process (4). 

As an example of multiplication and addition we consider 
the problem of reducing the product 6/ X of the two repre- 
sentations (5/, of c into its irreducible components. The 
result is contained in the formula 


(5.9) 


V 


where v runs through the series 

» = /+«./ + «- 2, I/- e 


( 5 . 10 ) 


without repetition, decreasing by 2 from term to term. This 
equation is essentially identical with the Clebsch-Gordan series 
which plays such an important role in the theory of invariants 
of binary forms. We shall see in the succeeding chapters that 
it may justly be considered as the fundamental mathematical 


* Cn will usually denote the group of all non-singular linear transformations 
in M -dimensions : it will however occasionally be used to denote the more 
restricted unimodular group, in which case the restriction will be explicitly 
stated. 
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formula for the classification of atomic spectra and for the theory 
of the valence bond. 

The proof consists in showing that 

= + xVi). (s-n) 

for (6.9) then follows by mathematical induction and the fact 
that obviously 

(£/ X ©/. 

A new co-ordinate system for the representation space of 
is obtained by replacing the basis (5.7) of homogeneous poly- 
nomials of order / by another basis. In this sense we can say 
that the polynomials of order / constitute the substratum of 
the representation (J/. The substratum of the representation 
©/ X is then the totality of polynomials 

0 = 0{x y, $r)) 

depending on the components of two arbitrary vectors {xy)^ 
(^7^), homogeneous and of order /in the first, and homogeneous 
and of order g in the second ; we write the total order f g ^ h. 
The 0 are thus linear combinations of the (/ + ^){g + I) 
monomials 

X' where i-\-k~f^i-\-K-=^g, (5.12) 

Both vectors are transformed cogrediently under the same trans- 
formation (5.6). The problem consists in completely reducing 
the space of the polynomials 0 into two sub-spaces (0 )q and 
(0)' which are the substrata of the representations and 
X respectively. We first discuss the structure of 

these two sub-spaces. 

{*P)o- Expand 

{ttx + i3y)/(a^ -f- = a* • ^0 + ( j) • <f>i + ' ■ ' + P'* • <f>h 

(5.13) 

in powers of the undetermined coefficients a, p. The 
<f>i = ; ^rj) are special polynomials of the type 0 and span 

the sub-space (^)o- We must now show that this sub-space is 
invariant under the transformation (5.6) of the variables ; 
i.e. that = ^,(x'y' ; ^'rj') is a linear combination of the 
It is clear that if this is the case then c in- 
duces the representation in {0)o, for on identifying the two 
vectors 


(f>i becomes 


^ = x, r) = y 
4>i{xy ; xy) = • y\ 


(5.14) 
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Hence we are certain a priori that the /i + 1 functions are 
linearly independent. 

In order to arrive at the desired proof we replace x, y in 
(5.13) by 

x’ — ax by, y' = cx -{■ dy, 
and in the same way t) by 

r = v' = 

Now note that a.x' + ^y' is the linear form 

(oca “f" “b (^b “b ^d^y y^x “b Ry 
in X and y ; hence 

(OCX' + i3y')/(af + = (Ax + By)f{A^ + Brj)^, 

and by (5.13) 

z • i>i- 

On replacing A, B on the right-hand side of this equation by 
A — (xa B = cub 

and equating coefficients of we obtain (f>'> as a linear 

combination of the 

{0)\ The substratum of the representation (Sy_| X 
consists of the polynomials 

h) 

of order / — 1 in {x, y) and of order g — 1 in (^, rj). They are not 
polynomials of type 0 ; in order to increase the order in the 
components of each vector by 1 we replace each such W by 

0^{xrj~ yi) • 

The factor thus introduced in no way affects the representation. 

The last step in the proof consists in showing that the total 
space of polynomials 0 is completely reducible into these two 
sub-spaces ; i.e. in showing that any polynomial 0 can be 
written in the form 

0 = (^o<Ao + ^1^1 + • • • + ah<f>h) + {xr] — y^)W (5.15) 

with unique constant coefficients a,. (The development in 
terms of powers of the determinant xrj ~ y^ obtained from this 
by induction is the Clebsch-Gordan series.) First, the dimen- 
sionalities are correct, for 

(/+ i)(g + 1) = (/+^ + 1) +/g. 

Hence it suffices to show that the various terms in (5.15) are 
linearly independent, i.e. that an expression of the form (5.15), 
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n which is a polynomial of order / — 1 in [x, y) and of order 
■ — 1 in (^, -q), can vanish only if IP vanishes identically and if 
ill the coefficients a< are zero. The proof is extremely simple. 
iVe first let = [xy) as in (5.14), then the equation ^ = 0 
)ecomes 

+ a^x'^^^y + • * • + — 0 

dentically in x and y ; hence = 0. Having established this 
ve return to the two sets of variables xy ; and obtain the 

iquation 

{xq -y^)W ^ 0, 

rom which it follows that W = 0 — in an algebraic identity 
or polynomials we may always remove a factor, such as 
cr) — which does not vanish identically. 

Our formula (6.9) also holds for the group c of all linear 
:ransformations of y with non-vanishing determinant. We 
nust then interpret v = h—2lm (5.9) as that representation 
vhose substratum is the totality of homogeneous polynomials 
)f order z; in ::t: and y multiplied by [xr] — y^). In other words, 
:he new differs from the old in that the transformation of 
:he {v 4- l)-dimensional representation space corresponding to 
r in the representation is to be multiplied by the power 
-)( the determinant ad — be, 

(5/ X is a representation of Cg X C 2 , the group consisting 
3f pairs {s^ s') whose members 5 and 5 ' run independently through 
the entire group C 2 . On introducing the restriction that s' is 
the element s obtained from s by replacing the coefficients of 
the linear transformation 5 by their conjugate complex,®/ X 
becomes a representation ®/, g of C 2 , the substratum of which 
may be taken as the monomials 

X'y'‘ • k=f, t + /< = g) 

of order /in {x^ y) and order g in (x, y). It can be shown that 
5/, ^ is also irreducible. 

§ 6. The Jordan-Holder Theorem and its Analogues 

Perhaps the most fundamental theorem of mathematics is 
that on which the concept of cardinal numbers depends. Let 
the members of a finite set of objects distinguished by marks 
Uj bj c • • 'be exhibited individually in this order and associated 
with the symbols 1, 2, • • • n. The theorem then states that 
the “ number ” n is independent of the order in which the 
objects are exhibited. The proof of this theorem is of con- 
siderable mathematical interest and offers the simplest example 
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of the type of proof employed in establishing the Jordan- 
Holder theorem. A new enumeration consists in associating 
the symbol 1 with any one of the objects, the symbol 2 with 
any one of the remaining objects, etc., until the entire set is 
exhausted, the last object receiving the symbol n'. We assert 
that n' — n. 

The proof is divided into two steps. (1) If in the new enumer- 
ation the symbol 1 is associated with the same object a as in 
the old, our theorem for the series from 1 to n is reduced to that 
for the series from 1 to n — 1. This is immediately evident on 
discarding the object a and reducing by one the symbols as- 
sociated with the objects &, f, • • • in the new as well as in the 
old enumeration. (2) If, on the other hand, the symbol 1 is 
associated with one of the other objects • • • then in the 

new enumeration the object a is associated with some symbol 
i contained in the series 2, 3, • • •, n'. We now introduce a 
third enumeration which enables us to make the transition 
between the first and the second by interchanging the symbols 
1 and i in the second enumeration. The number n' is obviously 
unaltered by this process. But we have now introduced an 
equivalent enumeration in which the object a is associated with 
the same symbol 1 as in the original and have reduced the 
general case to the one considered in (1) above. The proof of 
the theorem then follows immediately by tlie method of 
mathematical induction. 

As an auxiliary result of these fundamental considerations 
we have the theorem that any permutation can be obtained by 
the successive application of transpositions. 

The J or dan- Holder theorem is concerned with an abstract 
group g. An invariant sub-group g' of g which does not coincide 
with g itself is said to be maximal if there exists no invariant 
sub-group of g — except g' and g — containing g'. The factor 
group g/g' is then simple^ i.e. it contains no invariant sub-group 
with the exception of itself and that consisting only of the 
unit element I. As was recognized by Galois^ the so-called 
composition series 

9o ~ 9) 9i> 92> ’ * *1 9*’-i) 9’* ~ I (®-^) 

is of fundamental importance for the solution of algebraic 
equations. This series begins with g and ends with I, and each 
member is a maximal invariant sub-group of the preceding 
member. We assume that the composition series terminates ; 
this is naturally the case for finite groups, as the order necessarily 
decreases from term to term. The successive factor groups 

9/9l> 9 i/92» ’ ’ Qr^ilQr ~ 9r-l (®-2) 
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are simple. The Jordan-Holder theorem asserts that the 
structure of these factor groups^ except for the order in which they 
appear^ is uniquely determined by g. 

Consider, therefore, a second composition series 

9o “ 9» 9i» 92) * * * 

of the same group g ; it is to be compared with the “ standard 
series ” (6.1). The proof of the fact that this new series also 
contains exactly r + 1 terms and that the corresponding factor 
groups are, except for the order in which they occur, isomorphic 
with the factor groups (6.2) is again accomplished in two steps. 

(1) If the two second members g'j, gi coincide, the theorem 
for the group g, whose standard series contains r + 1 members, 
is reduced to the corresponding theorem for the group g^, whose 
standard series contains but r members. 

(2) If gi and g'^ do not coincide we construct the inter- 
section of and g'l, i.e. the set consisting of all elements 
common to the two. f) is then an invariant sub-group of g'^ 
and, as we shall prove, Qf/i) is isomorphic with g/g^. That 
two elements 5, t of g are equivalent with respect to g^, i.e. that 
they belong to the same set,” is expressed by the equation 
t — UiS where is in g^. If s and t are at the same time elements 
of the sub-group g'^, then is also in g'l and consequently it 
is an element of f). We may therefore consider as the elements 
of Q\li} those sets in g which contain an element of g\. The 
elements contained in these classes then constitute an invariant 
sub-group § of g containing both g^ and g'j, and g\/^ is simply 
isomorphic with §/gi. But since g'j is maximal either ^ q 
or — g'j. The second case implies that g^ is contained in g'j, 
and since it is maximal it must coincide with g\, contrary to 
assumption. Hence § coincides with g and our assertion is 
proved. The intersection of gi and depends symmetrically 
on both, whence g/g'i and g^/^ are also simply isomorphic. 

We now proceed as follows. We construct a composition 
series for t), which we denote simply by • *, and compare 
the following four composition series of g : 

9> 9i> 927 * * ’ 

9, Gi. • 

9, 9'i, i), • • • 

9, 9'i. 9'2 • • • 

The comparison of the first and second series is reduced to case (1). 
The second and third series agree from the member f) on, and 
the two foregoing factor groups 

9/9i, 9i/^ 
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are, as we have seen, simply isomorphic with 

g/9'1. 9 'i/^ 

on interchanging their order. The comparison between the 
third and fourth series is again reduced to the case ( 1 ). The 
proof of the theorem for composition series containing r \ 
members is thus reduced to the proof of the corresponding 
theorem for series with but r members, and since it obviously 
holds for r = 2 (i.e. for simple groups) the method of mathe- 
matical induction establishes its general validity. 

The close methodological agreement between the construction 
involved in the proof of this theorem and that involved in the 
proof of the independence of the cardinal number of a set of 
the order in which the objects are enumerated is immediately 
evident. 

E. Noether ^ has given a generalization of the Jordan-Holder 
theorem which is of importance for us. A correspondence 
s s' As oi the group on itself is said to be automorphic if 
multiplication is invariant under it, i.e. if [st)' = s't' — we here 
neither assume that different elements .9 generate different 
elements s' nor that for a given element s' there exists an element 
s such that s s' in virtue of the automorphism. Let E be 
a system of such automorphic correspondences of g. We now 
admit only sub-groups of g which are invariant under 27, i.e. 
sub-groups whose elements are carried over by all operations 
of the system 27 into elements of the same sub-group. We say 
that two such “ allowed sub-groups g^ and g 2 have the same 
structure if we can set up a one-to-one simple isomorphic 
correspondence between the elements of the one and the ele- 
ments of the other in such a way that every operation A of 
the system 27 sends corresponding elements of the two sub- 
groups over into corresponding elements. The Jordan-Holder 
theorem still holds under this modification ; its proof can be 
aken over unaltered. 

The vectors of an n-dimensional vector space 9R constitute 
an Abelian group whose multiplication is the addition + of 
vectors. We must for the moment supplement addition by 
the operation of multiplication of a vector by an arbitrary 
number ; hence the concepts and theorems applying to vector 
space are not truly specializations of the concepts and theorems 
of Abelian groups, but there exists a thorough-going analogy 
between the two. Indicating this analogy between a group (on 
the left) and vector space (on the right) by ~ we have, for ex- 
ample, sub-group ~ linear sub-space, automorphism ~ linear 
correspondence. Indeed, a linear sub-space is a system of 
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vectors such that with j and ^ their sum 5 + 0 and the product 
A5 by an arbitrary number A also belong to 5R', and a corre- 
spondence 5 -> 5' = /I5 is linear if it sends 5 + 0 and A5 over into 
5' + 0' and A5', respectively. Every “ sub-group ” is here 
invariant, as we are dealing with Abelian groups. If 91' is 
a sub-space of 91 the space 9^ (mod. 91') obtained by projecting 
9? with respect to 9f' is the exact analogue of a factor group. 
A composition series consists of a sequence of spaces each 
member of which is a linear sub-space of the preceding one 
and has one less dimension. The last member is the space 0, 
consisting of the vector 0 alone, and the number of members in 
the series is 1 greater than the dimensionality n. The Jordan- 
Holder theorem is here valid but trivial. 

On the other hand, this theorem is of considerable importance 
on going over to Noether’s generalization. Consider a system 
E of linear correspondences of the vector space 91 on itself ; the 
terms invariant, equivalent, reduction shall in the following refer 
to this system. Two invariant sub-spaces 9Ii and 912 are similar 
or equivalent if a one-to-one linear correspondence 5i ^ 52 can 
be set up between the vectors of the one and the vectors of the 
other in such a way that any operation A of the system sends 
corresponding vectors over into corresponding vectors. On 
reading the series (4.3) established in § 4 backwards, we have 
the exact analogue of the composition series : each member of 
the series is followed by a maximal sub-space which is invariant 
under E. (The possibility of constructing the composition 
series in increasing as well as decreasing order is due to the 
fact that the addition of vectors is commutative.) Furthermore, 
we can obtain the concepts and theorems relating to a system 
E of correspondences as genuine special cases of those of group 
theory, and not merely as analogues, by supplementing the 
system E with all similarity transformations, i.e. by all corre- 
spondences of the form representing multiplication 

by an arbitrary number A. The Jordan-Hblder-Noether theorem 
now states : Given a second composition series 

0, %, %, . • 91, (6.3) 

the corresponding projection spaces 

9i;, 9i; (mod. 9i;), 91; (mod. 9i;), • • • 
are equivalent to the projection spaces (4.3) 

9Ii, 912 (mod. 9li), 9I3 (mod. 9I2), ’ ’ • 
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of the original series, taken in a suitable order. The number 
of members is, of course, the same in both. The reader is 
advised to reconstruct the proof of this theorem by carrying 
through the proof of the Jordan-Hdlder theorem step by step 
for this case. 

In particular, if the system S consists of the transformations 
U{s) associated with the various elements 5 of a group in a 
representation § : U{s), our result yields the 

Uniqueness theorem : The irreducible representations separated 
off from § by successive reduction are completely determined by §, 
except for the order in which they occur, considering equivalent 
representations as the same. In particular, the complete reduction 
of § into irreducible components is unique, always considering 
eqjiivalent representations as the same. 

§ 7. Unitary Representations 

For the case in which the representation space is unitary 
and the correspondences U{s) of 9i on itself, associated with 
the element s of the group under consideration, are also unitary, 
certain of the concepts introduced above are to be modified 
accordingly. Two representations 

5 U{s), s -> U'{s) = AU{s)A-\ 

are to be considered as equivalent only if A is unitary, i.e, if it 
is a transformation from one normal co-ordinate system in 
91 to another such. If 91' is a sub-space of 91 a unitary-orthog- 
onal co-ordinate system can be set up in 9i' and supplemented 
by additional fundamental vectors to form a complete unitary- 
orthogonal co-ordinate system for the entire space 9? : every 
sub-space of a unitary space is per se unitary. Invariance and 
reduction remain as before, but we allow only those decom- 
positions of 91 into two sub-spaces 9Ii + 9?2 in which 9li, 9^2 
are perpendicular. For a system of unitary correspondences 
reducibility implies complete reducibility and we have the theorem : 
If 'Si' is invariant xoith respect to E then 91 may be broken up into 
91' + 9i" in such a way that 9i" is also invariant under E. We 
need merely to define 9^" as the space defined by all vectors per- 
pendicular to 9i'. The theorem naturally holds for the case in 
which Z" is a system of infinitesimal unitary correspondences or, 
what amounts to the same, a system of Hermitian forms. The 
theorem developed in the preceding section proves that these 
irreducible components are uniquely determined, in the sense 
of (unitary) equivalence, to within a permutation. 
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Examples 


(1) The Unitary Group in Two Dimensions 

The group c = C 2 of linear transformations in two dimensions 
contains the sub-group U = U 2 of unitary transformations. 
Hence the representation of c obtained in § 5 is also a repre- 
sentation of u. This representation is not unitary as it stands, 
but it can readily be made unitary by a slight change. The 
transformation of 6^ corresponding to the unitary transforma- 
tion s of the co-ordinates y is that induced by s on the monomials 

Xn = xY {i + k=^f) (7.1) 

of order /. For purposes of symmetry we label these co-ordinates 
with the index n ^ i — k which runs through the values 
/, / — 2, • • This is also desirable because on restricting 

ourselves to the sub-group of “ principal transformations ” 

1 

X ex, y -y 


Xn is multiplied by the factor £”, We now employ, instead of 
(7.1), the vaiiables 


xy^ 

Xn — — 7 ^^ 

Vilkl 


(7.2) 


obtained from them by multiplication with a constant. The 
representation of u will then be unitary, as follows from the 
equation 

j- {xx + yy)! -= = Ex,,x^. 

We call (£/ even or odd according as / is even or odd. The even 
representations associate the identity 1 with the reflection 

— y, 

and the odd associate with it the transformation — 1. S/ is 
also irreducible when considered as a representation of u, and 
on letting / assume the values 0, 1, 2, • • • they form a complete 
system of inequivalent irreducible representations of u. The proof 
of these assertions, which we employ heuristically in the follow- 
ing, will be given in Chapter V. On writing a homogeneous 
polynomial of order / in the variables x, y in the form 

the coefficients transform under the influence of a unitary 
transformation s like the components of a vector in the repre- 
sentation space of 
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The complete reduction 

((5/ X = 6/+y “h (®/-l ^ ®?-l) 

was accomplished by breaking up the space of the “ polynomials 
4> ” into two invariant sub-spaces (0)o and i^)'. We must 
now verify that these two sub-spaces are mutually orthogonal 
in the unitary sense. A general polynomial 0 may be written 









where the x„ are given by (7.2) and the are the corresponding 
monomials 




l \ k\ 


(t -f K = g, I - a: = »/). 


Two such polynomials 0 with coefficients a„^, b„^ are orthogonal 
if 

bfir 0 * 

The polynomial x/^g, whose highest coefficients a/g = 1 while 
all others vanish, is to -within a constant factor x^ • and is 
obviously perpendicular to all polynomials (0)', for in all these 
latter the coefficient of x^^9 vanishes. But under the unitary 
transformation 

s : x’ = (xx ^y, y' = — '^x + ay, (7.3) 

where aa + goes into 

{(XX -f Pyy{o.$ + yri)9. (7.4) 

Since {0)' and the orthogonality of polynomials are both in- 
variant under the unitary transformation s, (7.4) is also orthog- 
onal to (0)' and, with the help of the definition (6.12) of {0)o, 
it follows from this that all polynomials of {0)q are unitary- 
orthogonal to those of {0)'. 

(7.3) is the most general unimodular unitary transformation. 
This is derived in the same way as the familiar formula for the 
orthogonal transformations of two variables with unit deter- 
minant in plane analytical geometry. On writing the coefficients 

a = /< -T lA, = — (I iv (7.5) 

in terms of their real and imaginary parts we see that each such 
transformation is characterized by four real parameters k, A, /x, v, 
the sum of whose squares is 1. The composition of two trans- 
formations s : {k, a, p., v) is accomplished in terms of these 
parameters by Hamilton's quaternion multiplication; this latter 
led to the vector calculus. 
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(2) Unitary Groups in n-Dimensions 

The totality of tensors of order / is the .substratum of an 
tf-dimensional unitary representation (u)/ of the group u = u„, 
or on denoting the components of an arbitrary tensor by 
Fihh ' ‘ ' if) the sum 

Z \F(ip, • • • ij)\^ (7.6) 

(*i. • • ',if) 

is a unitary invariant. On restricting ourselves to the 

-dimensional linear manifold of anti-symmetric tensors we 

Lake as the variables in tensor space those components 
F(fiZ 2 ’ • • if) for which < z 2 < * ‘ * < V- The sum (7.6) 
For these components only is, however, equal to the complete 
5um (7.6) divided by /!; hence the representation {u}f of U, 
^hose substratum consists of all anti-symmetric tensors, is 
unitary. The situation is somewhat different for symmetric 
tensors. The most general symmetric tensor of order / trans- 
forms like E X J X • • • X J (/ terms), i.e. we may for the 


present purpose set 

F{i\H • • • if) == ( 7 . 7 ) 

We write the monomial on tlie right in the form 

^{‘4’ • • • 4” (5.1) 


as before ; fr is the number of times the index r appears in the 
series 1 * 2 , • • •, ij. In this sense we write the components of 
a symmetrical tensor 

Fiiiii • • • if) = A. • • •, In)- 

The sum (7.6) becomes in this case 

■ ■ •./.)!’. 

extended over all integral /r ^ 0 for which /j + /2 + * * * + /n = /• 
The coefficient indicates how often the term \F{iii 2 ‘ * * if)\^ 
occurs in the sum in consequence of the fact that its value is 
unchanged on permuting the indices. We must therefore 
consider the quantities 

<^(/l>/2> * * *) In) 

VfC-fC • ■ ■ fn\ 

as independent components of an arbitrary symmetric tensor 
of order / in order to obtain a unitary representation [u].f. 




140 GROUPS AND THEIR REPRESENTATIONS 


The truth of this assertion follows from the fact that the special 
tensor (7.7) satisfies the equation 




+ Xn^nY ■ 


,x0 


xl” • xf* 




A! - • •/»! 


(7.8) 


We have already seen in I, § 5 that a normal co-ordinate 
system can be so chosen that a commutative system S of 
unitary correspondences is completely reduced to a set of 
1 -dimensional systems. The only irreducible unitary repre- 
sentations of an Abelian group are accordingly dimensional. 
For it follows from 

U{s)U{t) - U{st) (4.1) 

and the Abelian character of the group that the unitary matrices 
U{s) associated with the elements 5 are commutative. 

If § and are unitary representations, then § -b 
§ X are also. — The first fundamental problem for a given 
group g is to find a complete system of inequivalent irreducible 
unitary representations of g, for then any unitary representa- 
tion of g can be obtained by the addition of these irreducible 
representations. The second fundamental problem is to reduce 
the product § X §' <?/ two irreducible representations of g 

into its irreducible components ; or better (after having solved the 
first problem), to determine how often each of the irreducible 
representations occurs in this product. 

We illustrate these problems on the example offered by 
rotation groups, which are of particular importance in quantum 
physics. 


§ 8. Rotation and Lorentz Groups 

{a) The Group of Rotations in the Plane 

We describe the 2-dimensional plane by a complex co- 
ordinate X. The rotations of the plane are then given by 

X x' = zx, (8.1) 

where £ = e'^ is a constant with unit modulus. (The rotations 
of the real 2-dimensional plane thus coincide with the unitary 
transformations of a single complex variable.) The angle of 
rotation <f> determines the rotation completely, but it is of course 
only determined mod. 27r by the rotation. The angle of rotation 
behaves additively on composition : the rotation ^ followed by 
the rotation <f>' results in the rotation <f> + <!>'. This rotation 
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group is accordingly a one-parameter continuous Abelian group. 
We obtain a 1-dimensional representation of our rotation 
group b ~ b 2 by associating with the element e, (8.1), the linear 
correspondence 

x~^ x' = - X = (8.2) 

where m is any fixed integer. I assert that the nt running 
through all integral values, constitute a complete system of 
irreducible unitary representations of b 2 . This can be seen as 
follows. 

Any irreducible representation is necessarily 1-dimensional : 
it associates with the rotation (f> a number x{^) of absolute value 
1 such that 

x{<j> + <f>') = • xif)- 

Wc assume that our representation is continuous ; then xW 
is a continuous function of cf) with period 27r. First, ;^(0) = 1. 
We write xi^) “ ^od determine X{(f>) uniquely by the require- 
ments that A(0) — 0 and that A(<^) shall be a continuous function 
of (f). We then have 

A(,^ + f)=A(^) + A(f), (8.3) 

for the right- and left-hand sides of this equation could at most 
differ by an integral multiple of 27r, but as it is written both 
sides agree for </>' 0 and vary continuously with (f>\ (8.3) 

satisfies the condition A(0) ” 0 and we obtain from it the further 
equations 

A(-<^) =-A(<^), (8.4) 

where h is any integer. On replacing <f> in the second of these 
equations by ^jh we obtain 

A(f) = (8.5) 

It follows immediately from (8.4), (8.5) that for every rational 
number kjh {k, h integers) 

A(|^)=jA{#). (8.6) 

In accordance with our assumptions A(27r) is an integral multiple 
^m-n of 277. On setting ^ = 2?? in (8.6) we obtain the equation 
X[<f>) = in(f> for all ^ which are rational fractions of 277 ; the 
continuity requirement then allows us to assert its validity 
for all real values of the argument 
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The simple equation 
is here valid. 

Consider the function f{p) on the unit circle in the complex 
X plane. If the point p goes over into the point p' under the 
rotation e, the function / goes into a function /' which is defined 
by the equation 

np’) =f{p)- 

The transition /-> /' is a linear correspondence in the oo-dimen- 
sional space of functions f{p) and is associated with the rotation 
e ; this obviously defines an oo-dimensional representation of 
the rotation group b 2 , which we denote by ^ is unitary if 
we take as the square of the absolute value of a “ vector ” / 
the integral of |/(p)|’^ with respect to the element of arc dp on 
the unit circle. The fact that any function (satisfying suitable 
conditions) on the unit circle can be developed in a Fourier 
series means that in the reduction of ® into its irreducible com- 
ponents each of the 1-dimensional representations occurs 
once and only once. More precisely, this reduction is to be inter- 
preted with regard to the completeness relation. 

{b) The Group of Rotations in ^-dimensional Space 

We consider the functions / = f[P) on the unit sphere as 
the vectors of an oo-dimensional unitary space whose metric 

is given by ^\f{P)\^da) ; dw is the surface element of the sphere 

over which the integration is to be extended. If the point P 
goes over into P' = sP under the rotation s, the function / 
goes over into the function /' defined by f'(P') = f(P). The 
surface harmonics F[ of degree I [cf. II, § 4] obviously span a 
(2/ + l)-dimensional sub-space which is invariant under the 
totality of transitions f f induced in function space by the 
various elements s of the rotation group b = bs — here again we 
speak of this representation as They are consequently the 
substratum of a certain representation 2){ of b which is induced 
in 9lj by b. On choosing a definite direction as that of the 
2 -axis we may, as in II, § 4, take the set 

= • •,-/) 

as a basis for the surface harmonics of degree 1. We then have 
a unitary representation, and the sub-spaces 91, corresponding 
to the various values 0, 1, 2, • • • of / are mutually perpendicular 
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in the unitary sense (orthogonality properties of surface har- 
monics). b contains the 2-dimensional rotation group b 2 — e.g. 
as the sub-group of rotations about the 2 -axis. The structure of 
shows that on restricting bg to this sub-group b 2 the 
representation is reduced into the 1-dimensional representa- 
tions for which m = I, / — 1, * • *, — The fact that 
any function on the unit sphere possesses a unique expansion 
in terms of surface harmonics means that on reducing ^ into 
its irreducible components each of the representations I = 0, 
1, 2, • • *, occurs exactly once. This reveals the true signifi- 
cance of surface harmonics ; they are characterized by the 
fundamental symmetry properties here developed, and the 
solution pf the potential equation in polar co-ordinates is merely 
an accidental approach to their theory. 

Rotations are orthogonal transformations of three variables 
y, z. If we wish to include with the proper rotations with 
determinant + 1 also the improper ones with determinant — 1 
— “ augmented rotation group b’ ” — this can be done by intro- 
ducing the reflection 

i:x’=--x, y' = -y, z' == - z (8.7) 

in the origin. Its reiteration ii is the identity, and it commutes 
with all rotations. The matrix corresponding to it in the 
representation defined by the surface harmonics of degree I is 
the (2/ + l)-dimensional matrix (— 1)^, for the surface harmonics 
of degree / are homogeneous polynomials of degree / in y, 2 . 
We can thus obtain two representations ^ of the aug- 
mented rotation group from the representation of proper 
rotations ; these two coincide with for proper rotations, 
but in the first the matrix associated with the reflection z is + 1 
whereas in the second it is — 1 . We call this ± 1 the signature 
of the representation. Hence in the oo-dimensional repre- 
sentation ^ of the augmented group b’ each occurs once 
with signature (~ 1)^, but not with the opposite signature. 
Although we are not as yet in a position to prove it, the 
*2)^ (/ 0, 1, 2, • • •) constitute a complete system of in- 
equivalcnt irreducible (single-valued) representations of the 
rotation group b, and the together constitute such a 

system for the augmented rotation group b’. 

Now consider the unitary function space of all functions 
f[P) in 3-dimensional space for which the integral \f\^ over all 
space is finite. Let the representation induced in this space 
by rotations in which the transition from / to the transformed 
function /' — sf is associated with 5 , be denoted by 6. Each 
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function f{P) can be expanded in a series of terms of the form 
• Y i. Choose a complete ortHbgonal system ‘ ' ’ 

in the domain of functions <f>{r) of the radius r, in the sense of 
the equations 

00 

\^^m{Mn{r)dr = 8 „„. 

0 

The functions of the form • Fj then constitute a (2/ + 1)- 
dimensional sub-space which is invariant under rotations 
and in which @ induces the representation 3 )i. Different 
are mutually unitary-orthogonal. Each then appears in 
6 infinitely often, its various occurrences being distinguished by 
the ‘‘ radial quantum number ’* n. Consider the analysis of 
single electron spectra given in Chap. II, § 5 , in the light of these 
mathematical developments. We then see that the azimuthal 
quantum number I is of purely group-theoretic significance, 
whereas the radial quantum number n refers to the dynamical 
situation, for the manner in which the orthogonal system 
is to be chosen is determined by the dynamical differential 
equation. 

The proper rotations of 3 -dimensional Euclidean space about 
the origin of Cartesian co-ordinates y, i.e. the real orthog- 
onal transformations with determinant + I, are most easily 
represented by a stereographic projection of the unit sphere 
about the origin on to the equatorial plane 0 “ 0, the south pole 
of the sphere being the centre of projection. If the point 
{x\ y', 0) be the image on the plane of the point (%, y, z) on the 
sphere and we write = x' iy\ the formulae for the projection 

, . H ■ l-£j 

* + ’^=r+i?' ’-'y=iTXC "”r+Tr 

But it is preferable to introduce the two homogeneous complex 
co-ordinates i, r/ in place of C by means of the equation ^ — 
the south pole ^ ; 77 = 0 : 1 is then included. We then have 

X iy: X — iy : z : 1 = 

217I ; '■ — -nv ‘ + VV- 

Accordingly each unitary transformation 

77' = + 877 

of the co-ordinates 77 corresponds to a rotation s of the sphere, 
the points of which are represented by the rays ^ : 77 of 2-dimen- 
sional unitary space. Since, as is readily seen, any point and 
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tangential direction through it on the sphere can be carried 
over into any other such configuration on the sphere by means 
of such rotations, we obtain in this way all rotations. Since 
we are only concerned with the ratios of the coefficients a, jS, 
y, 8, the arbitrary factor of proportionality may be chosen in 
such a way that the determinant of the transformation is 1. 
Nevertheless this normalization is somewhat artificial as the 
correspondence is still double-valued, for on multiplying the 
coefficients of the unitary transformation by — 1, i.e. on going 
over from a to a, the normalization is unaffected. Hence to 
each element cr, (7.4), of the tinimodular unitary group u corre- 
sponds a rotation s : a s under which the co-ordinates 
X + iy, X ~ iy, z transform like 

H ( 8 . 8 ) 

or 

X + ^fj, y |(7j| — Z — r]ij. (8.9) 

(The symbol which we occasionally employ, means that the 
expression on the left transforms like the one on the right.) 
We obtain in this way all rotations, each one exactly twice. 
The rotations about the xr-axis are obtained from the “ principal 
transformations ” 

C = v \v 

of u. In fact, on setting e “ ~ e{a>) the angle of rotation 
about the 0 -axis is — — 2oj. In virtue of the correspondence 
a s the rotations in 3-dimensions constitute a representation 
of the group U ; and, conversely, the association s a is a 
representation of the group b -- ba of 3-dimensional rotations 
by U, although this representation is double-valued. In virtue 
of this correspondence s a any representation U{a) of u yields 
a representation of ba (“ F process,’' § 5) ; may thus be thought 
of as a representation of ba, in which case we write it where 

y ™ The (‘"even”) with integral j are single-valued, 

those with half-integral (i.e. half an odd integer) j are double- 
valued. On restricting the group bg to the sub-group bg of 
rotations about the s-axis 2)^ is reduced into the 2; -f- 1 one- 
dimensional representations [m — j, j — 1, • • •, — ]). To 
show this we first note that the substratum of our representation 
consists of the monomials (7.2) 

xim) = -T=f {i + k = 2 ;, i -k = 2m), 

Vt ! k ! 



146 GROUPS AND THEIR REPRESENTATIONS 


where m runs through the values • • •, — j. The 

transformation induced on these variables by a rotation </> 
about the 0-axis is accordingly 

x{in) e{—‘ m(f>) • x{in). 

The representation a -> 5 of u is itself contained among the 
representations 3)^ of u constructed above ; it is, in fact, 3)i. 
To show this we note that if (^, tj), 77') be subjected to the 
same transformation a of U, then the determinant ^77' — 77^', as 
well as + 7777, is invariant. Consequently (|, rj) transform co* 
grediently to (77', — ^'), or as (77, — ; hence 

X -j- iy ^ 77*^, X — ty 0 ~ ^77, (8.10) 

The representations 2)^ with integral j are identical with those 
obtained above as the representations induced on surface har- 
monics of order for each polynomial in x, y, 0 of degree ; is, 
in virtue of (8.10), equivalent to a form of order 2 j in 77. 

If we wish to augment u — Ug in a manner paralleling the 
augmentation of b = ba by the improper rotation t (reflection 
in the origin) we must consider it as an abstract group rather 
than a group of linear transformations in two variables. Denote 
the element corresponding to i by t and the elements of the 
original u by a as before. We define the augmented u* as the 
totality of elements of the types a and la ; t must naturally 
obey the multiplication laws 

tor — at, tt — 1. 

and (£“ are then those representations of u’ which coincide 
with for elements of the restricted group U and which as- 
sociate with the element t the unit matrix + 1 and its negative 
— 1 , respectively. The sign ± is again called the signature. 
The representation associates the augmented rotation group 
b’3 with u’. 


(c) The Lorentz Group 

Let the 3-dimensional Euclidean space be referred to homo- 
geneous projective co-ordinates (a = 0, 1, 2, 3) defined by 



The equation of the unit sphere is then 

— xl + x\ -\r xl + xl = 0 


( 8 . 11 ) 
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and the formulae for the stereographic projection considered 
above become 

Xq = U + rj-q, I 

Xz = — ■76. X3= — rj-q j' 

On subjecting tj to an arbitrary linear transformation a the 
Xoi undergo a corresponding real linear transformation s which 
leaves the equation ( 8 . 11 ) invariant. If the absolute value of 
the determinant of or is 1, we can readily show that the form 

— ^0 4 " + ^2 ~ 1 " ^3 ( 8 . 13 ) 

is itself invariant under the corresponding s, and that the deter- 
minant of 5 is + 1. 

We now consider Xq — ct, x^, X2y x^ as the co-ordinates of 
space-time; (8.11) is then the equation of the light-cone, the 
generators of which are the possible paths for a beam of light. 
In the restricted thetiry of relativity normal co-ordinate systems 
for space-time are connected with each other by arbitrary 
Lorentz transformations, i.e. by any real linear transformation 
which leaves the form ( 8 . 13 ) invariant and which does not 
interchange past and future. Lorentz transformations con- 
stitute a group, the “ complete Lorentz group,’* and this group 
describes the homogeneity of the 4 -dimensional world. This 
group consists of ''positive'' and "negative" transformations, 
i.e. transformations with determinants -f 1 and ~ 1, respectively. 
The first constitute the “ restricted Lorentz group,” from which 
the complete group is obtained by introducing in addition the 
spatial reflection 

^0 (a -= 1 , 2, 3 ). ( 8 . 14 ) 

Under the restricted group right and left, as well as past and 
future, are fundamentally different. Since the expression for 
Xq in (8.12) is positive definite, we may state the result obtained 
above in the form : any linear transfor^nation of 17, with deter- 
minant of absolute value 1, induces a positive Lorentz transforma- 
tion s in the Transformations a which differ only by a factor 
e^^ of absolute value 1 give rise to the same s. The correspondence 
a s naturally a representation. 

The question of whether every positive Lorentz transformation 
s can be obtained in this way arises immediately. That this 
is in fact the case can be seen from general continuity con- 
siderations, for the positive Lorentz transformations constitute 
a single connected continuum. But it is also easily proved by 
elementary methods. Since we have seen in [b) above that the 
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rotations of space 5 are obtained from the unitary transforma- 
tions a, we need only to examine the Lorentz transformation 

(^0 + ^3) + ^3). (^0 — ^3) 4(^0 — ^3), 

tt 

Xi -> Xi, X2 -> X2, 

affecting the time axis, where a is a real non-vanishing constant. 
But this transformation is obtained from the unimodular a : 


Returning to the general case, the correspondence s -> a is a 
2-dimensional representation of the restricted Lorentz group. 
But a is determined by s only to within the arbitrary “ gauge 
factor” we may therefore normalize it by the condition 

that the determinant of a shall itself be unity, not merely its 
absolute value. Even so, a remains double-valued, for — a 
satisfies the normalizing condition as well as a. This repre- 
sentation s a contains the representation of the rotation 
group considered in {b) on allowing s to run through the sub- 
group of spatial rotations contained in the restricted Lorentz 
group. 

The expressions (8.12) are Hermitian forms with matrices 


1 0 

, 5i = 

0 1 

, ^2 = 

0 -i\ 

, ^3 = 

1 0 

0 1 


1 0 


i 0 

0 -1 


Hence if % denotes the one-columned matrix with elements rj 
equations (8.12) may be written 

(8.16) 


On replacing rj by T], ~ | the x,^ undergo the spatial re- 
flection (8.14). That is one way of including the negative 
Lorentz transformations. But if we require that the corre- 
sponding transformation of rj be linear, we must introduce in 
addition to J = (^, r]) a second pair = (^', rj') which undergoes 
the transformation a contragredient to 3. Then 

(t), — ^)~(^', 7]') to within the factor d, 

('*?, “ l)~(l', v) within the factor d, 
where d is the determinant of cr. Defining 


S' = S 


the quantities 


(a =1,2, 3), 
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undergo the same transformation 5 as (8.16), provided the 
absolute value of the determinant of a is 1. The same is true 
for any linear combination of the two, e.g. x'^. Hence the 

quantities 

= (8.17) 

undergo the given positive Lorentz transformation .s when rj 
are subjected to a certain transformation a and simultaneously 
rj' to the transformation a contragredient to a. Furthermore^ 
they undergo the transformation (8.14) on interchanging the two 
pairs 5, j', i.e. on subjecting the four variables to the trans- 
form ^^tion 

T \ ^ V ] v' V' (8.18) 

The expression 

+ vv' 

is invariant in virtue of the transformation law of r/' defined 
above. To obtain an expression which is also invariant under 
the interchange (8.18) we must add to the above the expression 
obtained from it by this interchange : 

(If + vvl + (I'l + vv)- (8-19) 

It will be found advantageous to denote the column con- 
sisting of the four elements (^, rj ; rj') by a single letter J. 
Let that linear transformation of these four variables which 
transforms rj in accordance with S., and if in accordance 
with be denoted simply by .ST : (8*17) then becomes 

AT. (8.16') 

We must now ask to what extent the linear transformation or 
of the four variables j is determined by the requirement that 
it induce a given (positive or negative) Lorentz transformation 
s of the Hermitian forms .r^. It suffices for this purpose to 
inquire what transformations of the J induce the identity on 
the variables x^. The only transformations of this latter kind 
are those which multiply rj with a common factor e^^ of absolute 
value 1 and at the same time rj' with any factor e'^' (inde- 
pendent of the first) of absolute value 1. But a can be more 
precisely specified by the requirement that (8.19), i.e. J T j, be 
also invariant. The two arbitrary gauge factors ” e^^, e*^' 
must then coincide : the substitution a is then determined to 
within a factor e'^. 

Our analysis reduces the problem of the representations of 
the Lorentz group to the corresponding problem for the uni- 
modular linear group C 2 . 
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§ 9. Character of a Representation 

The trace of a linear correspondence i.e. the sum of the 
elements in the principal diagonal of the matrix is an in- 
variant under transformations of co-ordinates which is of 
particular importance. The trace xW the correspondence 
U{s) associated with the element s of the group g in a repre- 
sentation § of g is called the group characteristic^ or, in 
order to avoid assigning yet another meaning to this second 
word, which has already appeared in another important con- 
nection in quantum mechanics, simply the character of the 
representation Q, Equivalent representations have the same 
character ; the name is so chosen because the converse of this 
theorem is true within wide limits. Since U{\) ~ 1, the value 
of the character x(l) for the unit element is equal to the dimen- 
sionality of the representation. 

It follows from the equations 

U{asa-^) - U{a)U{s)U{a-^) - U{a)U{s)U-'\a) 

that the matrices U{s) and U{asa'~^) differ only in their orienta- 
tion and consequently have the same trace : 

X(a5a-*) -= x(-^)- 

Now s and asa"^ arc any two conjugate elements of the group g, 
i.e. they belong to the same class of conjugates in the sense of 
§ 3. We speak of a function f{s) on the group manifold which 
has the same value for all elements 5 belonging to the same 
class as a class function ; such a function can at mf)st allow us 
to distinguish between different classes, but not between ele- 
ments of the same class. The distinguishing feature of class 
functions can also be expressed in the equation 

f{sl) 

The validity of this equation for f == x follows from 

U{st) = U{s)U{t), U{ts) ^ U{t)U{s) 

and the fact that the trace of the matrix AB is equal to the 
trace of BA. _ 

The character x(^) of a unitary representation : U{s^^)~U*{s)^ 
satisfies the equation 

x(5-i) == x(^)- (9-1) 

We shall say that the characters of irreducible representations 
are primitive. Any unitary representation § can be reduced 
into its irreducible components, and the normal co-ordinate 
system in the corresponding sub-spaces can be so chosen that 
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two irreducible constituents are equal if they are equivalent. 
If in this sense 

+ w'f)' + • • (9.2) 

where 1^', • • • are inequivalent irreducible representations and 
m, m' ' ' • are the numbers of times they occur in then the 
character X of is expressed in terms of the characters x, X,''' 
of f)', ■ • • by the equation 

X(5) = mx{s) + m’x'is) + • • •. (9.3) 

From an n-dimensional representation SS:s->U{s), with 
the character x{^)i an n'-dimensional Sy:s->U'{s) of 

character x{^) construct the (nn') -dimensional repre- 

sentation ^ X !q\ The elements in the principal diagonal of 
U{s) X U'{s) are obtained by multiplying all elements in the 
principal diagonal of U{s) by those in the principal diagonal 
of U'{s) : the character of X Sy is consequently x{^) 
if § is a representation of the group g, ^y a representation of 
the group g\ then the representation X £)' of g X g' has the 
character ^ defined by 

C{s,s')=-x{s)x{s'), (9.4) 

where s runs through the elements of g and .9' those of g'. 

We need not distinguish between a 1 -dimensional repre- 
sentation and its character ; the character satisfies the simple 
equation (4,2). 4'his holds, for example, for the characters 
e{ni(f>), eq. (S.2), of the rotation group b2. 

By the theorem on tlie transformation of unitary corresjiond- 
ences to principal axes, each element of the group U ^ U2 is 
conjugate to a principal element, i.e. an element of the form 

lei - 1 (9.5) 

The characteristic values e, 1/e are determined to witliin tlie 
order in which they appear. Introducing the angle w by the 
equation e - e{a)), co characterizes a class of conjugate elements 
of u ; we are only concerned witli u) mod. 27r, and furthernif)re 
the class — to coincides with the class to. Since for any re- 
presentation (S of u the character xi^) depends only on the class 
of the element s, it suffices to calculate it for elements of the 
form (9.5). It must be a periodic function of the angle to with 
period 27 t, and it must furthermore be an even function of to ; 
its value for ©/■is 

g/tl __ £-(/' 1) 


£ 0 | 
1 

I _ 

£ 


E — £~^ 


(9.6) 
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The characters of the representations considered in the 
other examples of the preceding section are just as readily 
calculated. 

§ 10. Schur’s Lemma and Burnside's Theorem 

Lemma (10.1).® Assumption. Let E be an irreducible system 
of linear correspondences of an m-dimensional vector space t 
on to itself, and Q such a system of an n-dimensional vector 
space §. A linear correspondence A shall satisfy the equation 

EA - AQ (10.2) 

in the following double sense : for each U oi E there shall exist 
3 .VoiQ such that 

UA = AV, (10.3) 

and conversely for each F of there shall exist a U of 27 such 
that this relation is fulfilled. 

Assertion. Either A ~ Q or m — n and det A 0 ; in the 
latter case E and Q are equivalent. 

Proof. We first make use of the assumption that E is 
irreducible in connection with equation (10.2) in the first sense. 
Considering the column 

* * *) ^mk 

of A as a vector equation (10.3) asserts that the vector 
associated with through the correspondence U is 
a linear combination of the vectors specifically that 

h 

Consequently the sub-space of r spanned by the n vectors 
is invariant under E. But because of the assumption that E 
is irreducible either ~ 0, ./I — 0, or the span the entire 

space r, in which case m of them arc linearly independent ; 
this latter is possible only il n ^ m. That our conclusion 
contains two possibilities is due to the fact that the concept 
of irreducibility contains such an alternative. 

The second part of the assumption can be given a simple 
geometrical interpretation on going over to the transposed 
matrices : Q* is irreducible and for each V* of Q* there exists 
a U* of E* such that 

V*A* - A*U\ 

The reasoning employed in the first part of the theorem allows 
us to conclude : either A* ~ 0 or m ^ n. We summarize the 
results thus far obtained in the statement : Either = 0 or 
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m — ft] in the latter case the m ~ n columns of A are 
linearly independent, i.e. the determinant of A does not vanish. 
But then U and V are determined uniquely by the relation 
(10.3) and E and Q are equivalent. 

In formulating these results it is desirable to consider the 
case of equivalence separately : 

I. If the two irreducible systems E, Q are inequivaleut^ (10-2) 
can only be satisfied by A ~ 0. 

II. If E is an irreducible system a correspondoice A commutes 
ivith all correspondences U of the system E : 

VA AV (10.4) 

if and only if A is a multiple of the unit matrix 1 . 

Assertion II follows from the lemma proved above by 
elementary methods and the fundamental theorem of algebra. 
For by the latter there exists a number a such that 
det (y4 — al) ~ 0, and since A A - al satisfies (10*4) for 
all U if A does, we conclude that since det A' --- 0 we must 
have A' - 0. 

Applied to representations, our results are : 

Fundamental I'heorem (10.5). 1. If s -> U[s), s -> F{s) are 

two inequivalcfit irreducible representations of a group g, the 
equation 

U[s)A - AFis) 

can be satisfied by no matrix .1 which is independent of s^ except 
A 0 . 

II. A matrix A which is independent of s and which satisfies 
the equation 

U{s)A - AU{s) 

for all s is necessarily a multiple of the unit matrix 1 . 

If there exists a matrix A which satisfies (7(5)/! — AU{s) 
identically in s and which is not merely a multiple of the unit 
matrix 1 , the argument employed above supplies us with a 
constructive process for the reduction of the representation 
s -> U{s) with the aid of A. 

Vic now consider an application of these important results, 
which are fundamental for the entire theory of representations, 
in order to prove a theorem due to Burnside, Let 27 be a 
multiplicative system, i.e. if U, U' arc two correspondences in 
E then the product LIU' is also a correspondence in 27. This 
concept is somewhat wider than that of a group ; we need not 
require that U possess an inverse — its determinant may be 0. 

Burnside's Theorem (10.6).® In an irreducible multiplicative 
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system E of linear correspondences U = ||^tfc|| of an n dimensional 
vector space on to itself the components Ui^ are linearly independent. 
This asserts that the only matrix L which satisfies the equation 

tr{UL) = 2:i,,tu,-=0 

i,k 

for all matrices U of the system is L = 0. Contrary to the 
assertion, we assume there exist non-vanishing matrices satis- 
fying this equation ; such matrices we shall call L^matrices. 
It is of course possible that every L-matrix whose first column 

vanishes must itself vanish. But in any case we can find a 
definite column index h with the following properties : there 
exist non-vanishing L-matrices whose first h — \ columns 
vanish and are such that if the column also vanishes then 
necessarily L — 0. We shall call L-matriccs whose first h — \ 
columns vanish special L-matrices. They constitute a linear 
family of m ^ n dimensions ; we denote a basis for this family 
by 

L(i), L<2), • • •, 

The column of a^special L-matrix will be written 1. 

Since E is multiplicative the equation 

tr {irUL) - 0 

is satisfied by each L-matrix, where (/, U' are arbitrary corre- 
spondences of the system E. With L, f/L is also an L-matrix ; 
obviously it is a special L-matrix if L is. Each of tlic matrices 

U/JM, LL(2), . . UL(-) 

is therefore a linear combination of L^*), • • •, and each of 
the vectors • • •, is a linear combination of the 

vectors • • •, b"*). Accordingly the vectors • • •, 
span a non-vanishing sub-space which is invariant under all the 
correspondences t/, and in consequence of the irreducibility 
assumed above it follows that m ~ n and the vectors • • •, 
span the entire n-dimensional space. The basis L^^), • ‘ *, 
of the family of special L-matrices can be chosen in such a way 
that I^^), • • •, are the fundamental vectors of the space ; 
is then the column (1, 0, 0, • • •, 0), etc. Since then 

f7I(0 ^ + • • • + (10.7) 

we must also have 

f/L(M 7ii,LO) -f . . . + 


(10.8) 
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We now consider an arbitrary column, say the k^, of L. 
(This is of course of no interest if k < h, for the first k — 1 
columns vanish.) Suppressing the second index k, we now let 
I — (/j, ' • l„) denote the column of L. Then in accordance 

with (10.8), equation (10.7) holds for the present I, i.e. the 
instead of the column of L. Introducing for the moment the 
matrix 

• • • /<») 

, 

/(.) . . . /o.) 

consisting of the columns of • • *, we may write 

(10.7) as the matrix equation 

IJA - AU. 


But it follows from this that A must be a multiple of the unit 
matrix, i.e. 


/</) = A • SJ, 



{r - i) . 

{r 4= i) ' 


or, returning to the original notation by adding the column 
index k, 

/<;) - A, • 8?. 


Here we have, by the foregoing, Aj — • • • Xi,-i = 0, A* = 1. 
riie equation 

tr (U/>)) 0 

becomes 

y ;<*,Afc=::0, (r= 1, • • •, n), (10.9) 

k 1 

i.e. all correspondences of the system Z* carry the vector A 
with components (A^, Ag, * * •, A„) over into the null-vector. 
In consequence of the irreducibility of Z this vector must there- 
fore vanish, which is in contradiction with the equation ~ 1 ; 
Burnside’s theorem then follows by reductio ad absiirdiim . — If 
we know that the unit matrix is contained in the system 2*, as 
is the case for a representation, we can conclude that A,- 0 by 

taking U in (10.9) as the unit matrix. 

Reducibility requires that on employing an appropriate 
co-ordinate system all matrices U of the system Z have an 
entire rectangle of vanishing elements and consequently implies 
a system of liomogeneous linear relations between the components 
Uiic of a very special kind. Burnside’s theorem states that if 
there exists no system of homogeneous linear relations of this 
special kind, then there exists no linear dependence at all. The 
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real reason for this remarkable fact is of course to be found in 
the assumption that U is closed with respect to multiplication. 

If our system Z consists of an irreducible representation 
which associates with the elements s of the group g the matrix 
U{s), we see from Burnside’s theorem that the components of 
U{s) are linearly independent. The method developed above 
can readily be extended to prove the same for the components 
of two or more inequivalent irreducible representations U{s)^ 
U'{s), • • From this it follows that in particular there can 
exist no linear dependences between their characters x{s)^ x{^)i * * *• 
Any unitary representation can be reduced into irreducible 
components ; the character of is expressed in terms of the 
characters of these irreducible representations by (9.3). Since 
x{s), x{^) linearly independent the coefficients m, m', • • *, 
which give the number of times the irreducible representations 
• appear in are uniquely determined. This con- 
stitutes a new indirect proof of the following result, which has 
already been proved in § 6 in a more general and more elementary 
way : The irreducible representations into which ^ cayi be reduced, 
as well as the number of times they occur, are uniquely determined 
by §, no distinction being made betweeyi equivalent representations. 
Two unitary representations and §2 obviously equivalent 
if every irreducible representation which is contained in the one 
is contained in the other the same number of times. Hence 
if and §2 inequivalcnt the character of cannot be the 
same as the character of §2 because of the linear independence 
of the primitive characters : a unitary representation is uniquely 
determined by its character alone, and its character may be used 
as a unique name for the representation itself. Wc here go no 
further into these extensions of Burnside’s theorem, which are 
due to Frobenius and /. Schur, as we shall obtain the same results 
by a more profound method in the next section under assump- 
tions which are more restrictive but which arc sufficient for 
our purposes. 

We mention only one consequence. §, being representa- 
tions of the groups g, g', respectively, then § X is an irreducible 
representation of g X g'. Indeed, there can exist no homo- 
geneous linear relation with constant coefficients between 

the components UiJc{s)u[^(s') of U{s) X U'{s') except the trivial 
one c = 0. For on applying Burnside’s theorem for the 
irreducible system § we have 

Z! IK ) ' 9 , 

(, K 

and on applying it again for !q' we must have Cuc , « — 0. 
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§ 11. Orthogonality Properties of Group Characters 

If the abstract group g is finite^ then any representation 
§ : ^ U{s) is equivalent to a unitary one. To show this take 
any positive definite Hermitian form, e.g. the unit form, subject 
it to all transformations U{s) of and sum over s, We thus 
obtain a positive definite Hermitian form H which is invariant 
under each of the transformations U{s). Now choose the co- 
ordinate system in such a way that H becomes the unit form ; 
then U{s)^ expressed in terms of these co-ordinates, is unitary. 
This same method of summation over the elements of the group 
gives rise to the fundamental orthogonality relations. 

Let : s U{s)^ Sf : s U\s) be two inequivalent irre- 
ducible representations of the finite group g, the former being 
^-dimensional and the latter ^'-dimensional. We write 

For a unitary representation 

If A is an arbitrary matrix with g rows and g' columns then 
obviously the sum 

2:U{l)AU’-^{t) = B, (11.1) 

t 

taken over all elements t of g, is invariant in the sense that 

U{s)BU'~^{s) - B. (11.2) 

In fact, the left-hand side of (11.2) becomes, in virtue of the 
fact that s Uis) is a representation of g, 

T 

where r 5/, 5 being fixed and / running through all elements 
of the group. We tlierefore obtain equation (11.2) or 

U{s)B - BU'{s). 

In accordance with the fundamental theorem (10.5) it follows 
from this that B ™ 0, i.e. 

( k.K 

Writing s in place of t and remembering tliat the a^,, are arbitrary 
numbers, we obtain the • g'^ equations 

IuAs)rC{s) = 0 , 

or, in dealing with unitary representations, 


(11.3) 
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Taking the single irreducible representation s U{s) in- 
stead of the two inequivalent representations we find by 

the same argument that the square matrix 

U{s)AU-^{s) = B, 

found from an arbitrary square matrix A, must satisfy the 

U{s)B = BU{s). 

This requires, however, that 2? be a multiple of the unit matrix 1 , 
i.e. 

27 Z !^ikis)ak^ u^,{s) = a • 8,... 

« k, K 

the number a depends on the matrix A, the dependence being 
of course linear and homogeneous. Taking as A that matrix 
which has as its only non-vanishing element a*., = 1, we obtain 
the equation 

ZUi,{s)u4s) = (11.4) 

* 

Now ||i<«(^)|| is the matrix reciprocal to ||m.,(.9)|| : 

Zu.,{syUk{s) = Kk- 

On taking i = i in (11.4) and summing over t = 1, 2, - • *, g 
we find that 

h-Kk = 

where h is the order of the group g. 

Expressing the sum 27 terms of the mean value == t 

# ^ * 
our results may be written in the form 

fR{u,,(5)u.(5)} for r - t, ^ 

lO otherwise 

for any irreducible unitary representation § : ^ > U{s) and 

m{u,,{s)u'Js)} - 0 (11.6) 

for any two inequivalent irreducible unitary representations 
s U{s)^ s U'{s), The components of one or more inequivalent 
irreducible unitary representations constitute a unitary -orthogonal 
set of ftmetions on the group manifold. 

It follows from these fundamental orthogonality relations 
that the compmients UaJ^s)^ linearly independent. 

Since the number of linearly independent functions of an argu- 
ment s which assumes but h values cannot be greater than h 
we must have 
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On the left-hand side of this equation occur the squares of the 
degrees of any inequivalent irreducible representation of g. 

We obtain the orthogonality properties of the characters 
by writing k = i, k = i in (11.5), (11.6) and summing over 
these indices : 

Any primitive character satisfies the equation 

3R{xWxW}= 1, (11-7) 

and the characters xi^)^ xi^) inequivaleyii irreducible 

representations satisfy 

nximn = 0. (11^7') 

The primitive characters of inequivalent representations constitute 
a normal orthogonal set of functions. They are consequently 
linearly independent, and from this follow all the consequences 
discussed in the previous section. In particular, a representation 
of g can be unambiguously described by its character, no dis- 
tinction being made between equivalent representations. The 
number of times m the irreducible y occurs in the representation 
X is, following (9.3), given by 

m ^ aR{X(.)x(5)}, (11.8) 

and we have 

!iR[X(6')X(5)} “ 

This last equation offers a simple criterion for the irreducibility 
of a given representation in terms of its character y • neces- 
sary and sufficient that the mean value of x\ “ jxl ^ — ivhich is in 
any case integral — be unity. 

Since the characters are class functions we are in dealing 
with them concerned with an argument which runs through 
the K different classes of g ; there can therefore be no more 
than K linearly independent class functions. He^ice a finite 
group can have no more inequivalent irreducible representations 
than classes. 

Whereas the general concept of a representation seemed at 
first to open up limitless possibilities, we now see that all 
representations arc constructed from primitive ones and that 
the number of possible primitive representations is confined 
within narrow limits. The further content of the general theory 
of representations can be stated in the theorem that the sets of 
functions^ the orthogonality of ivhich we have shown above^ are 
complete orthogonal systeyns. The primitive characters con- 
stitute a complete orthogonal system in the domain of class 
functions, i.e. there exist exactly K incquivalent irreducible 
representations. The components of a complete system of K 
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inequivalent irreducible representations constitute a complete 
orthogonal system for the totality of functions defined on the 
group manifold, or 

h + • • *, 

where the sum on the right is extended over such a complete 
system and g, g\ * * * are the dimensionalities of the individual 
irreducible representations. 

§ 12* Extension to Closed Continuous Groups 

The theory developed in the preceding sections cannot be 
extended to arbitrary groups, but it is applicable mutatis 
mutandis to a group whose elements constitute a continuous 
closed manifold of a finite number of dimensions. Just as the 
immediate neighbourhood of a point on a surface constitutes 
a plane, so the immediate neighbourhood of a point pQ on an 
r-dimensional continuous manifold constitutes an r-dimensional 
linear manifold and the line elements from to neighbouring 
points p define an r-dimensional linear vector space. We 
assume that the infinitesimal elements of our group g (i.e. those 
elements in the neighbourhood of the unit element I), or rather 
the infinitesimal vectors leading to them from I, constitute 
such an r-dimcnsional vector space, the tangential space 
to g at I. The concept of an infinitesimal rotation will be 
familiar to the reader from the kinematics of rigid bodies, as 
well as the fact that these infinitesimal rotations in 3-dimen- 
sional space constitute a 3-dimensional linear family — in 7Z-dimen- 
sional space an [n(n — l)/2]-dimensional family. The multiplica- 
tion of two infinitesimal elements of the group is then expressed 
by the addition of the corresponding vectorial line elements in 
the tangential space. 

A parallelepiped which will serve as a volume element in 
the neighbourhood of I is defined by r linearly independent 
line elements, and its volume is given as usual by the absolute 
value of the determinant of the components of these r vectors. 
This volume element is, of course, not entirely independent of 
the choice of a co-ordinate system in the tangential space, but 
the transformation to a new co-ordinate system only multiplies 
the volumes of all such elemental volumes in the neighbourhood 
of I by a constant numerical factor. These volumes are there- 
fore determined to within the choice of a unit of measure ; more 
than this we can hardly require. 

On extending the theory developed in the preceding section 
to continuous groups integration replaces su nmation, and it is 
therefore necessary to be able to measure volumes on the entire 
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group manifold of g. With the aid of the foregoing volume 
elements in the neighbourhood of I can be measured and com- 
pared immediately with each other, and the same is true for 
the volume elements at any other point of the group manifold. 
The only difficulty lies in carrying the unit of volume from the 
point I to any other point a. Examination of the argument 
of § 11 reveals that the measurement of volume must have the 
following invariantive properties : the volume of an arbitrary 
element must be unaltered by a left-translation of the group 
manifold which transforms the general element i into r at. 
But this requirement just suffices to specify the process uniquely. 
Consider the volume clement at a which arises from an elemental 
volume at I by the left-translation which throws I into a ; per 
definilionem the volumes of these iivo elements shall be the same. 
On carrying the volume element from a to ft by means of the 
translation /' ~ {ba~'^)t the equation = b[a~^t) shows that with 
this definition of volume the volumes of the elements so obtained 
at a and ft are equal. 

We further assume that our continuous group manifold is 
closed — in the sense, for example, that the surface of a sphere 
is a closed manifold in contrast with a Euclidean plane, which 
is open. This guarantees that we shall be able to integrate 
continuous functions of position on the group manifold over the 
entire manifold. We now choose the unit of volume in such a 
way that the volume of the entire manifold g is 1 ; the integrals 
arc then mean values. We naturally require that the components 
of U{s) in a representation s U{s) are continuous functions 
of the clement s of g. The laws (11.5), (11.6), (11.7), (11.7') 
and all consequences obtained 'from them in § 1 1 are then valid 
for irreducible representations of the continuous group g and their 
characters.^ 

The theory would be extraordinarily restricted if the measure 
of volume^ which zve have introduced in such a way that it is 
invariant under left- translations, ivere not automatically invariant 
under (1) right-handed translations : s s ^ sa and (2) inversion : 
s s' ~ The first of these properties will be established 

by showing that the volume of a volume element at i is unchanged 
on taking it to a by a left-translation and returning it to I by a 
right-translation. Obviously each infinitesimal element Ss of 
the group then undergoes the linear transformation A : 

8s 8's = a • 8s * a h 


i.e. the conjugation associated with the element a. Such 
linear transformations in the r-dimensional vector-space of the 
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infinitesimal elements of the group constitute a representation 
a -> A of the abstract group g. Since g is closed, each A must be 
“ absolute-uninwdular,” i.e. the determinant of A must have the 
absolute value 1 ; and this in turn allows us to conclude that 
the definition of transportation of volumes by either left- or 
right-translations leads to the same result. To prove this 
consider the element a and its powers a*, a®, • • •. Since the 
group manifold g is closed, the infinite set a, a®, a®, • • • on g 
possesses a point of condensation b, i.e. an infinite set of ex- 
ponents n can be found such that as n runs through this set 
a" converges to b. To the elements a" and b correspond the 
conjugations A’^ and B, respectively, and in virtue of the con- 
tinuity assumed above det (yl”) converges to det (B) as n runs 
through the chosen set. Now since det {B) is a finite non- 
vanishing number, and since, if the absolute value of the deter- 
minant of A differed from 1, det (A") would tend toward 0 or oo, 
we may conclude the truth of the above assertion. This also 
enables us to prove the truth of (2), invariance under inversion. 
For inversion sends the element 8^ at I into — 8^, and this 
transformation is absolute-unimodular. Now send one of two 
inverse volume elements at I to a by a left-translation and 
the other to a~^ by a right-translation ; we thus obtain volume 
elements at a and a~^ which go into each other by the inversion 
s->s' — s~^. Since both left- and right-translations conserve 
volumes, these two volume elements have the same volume. 


Examples of the Orthogonality Properties 


We have already found the primitive characters for the 
group of rotations bj of a circle into itself : e{m<f>), w = 0, ± 1, 
± 2, ' • where <f> is the angle of rotation. They constitute, 
in fact, a unitary-orthogonal set of functions : 


j e{m<f>) e{m'(f>) # = Pq 
0 ^ 


(m — m') 
{m =t= m') 


If there existed further irreducible representations their char- 
acters would necessarily be orthogonal to all of these ; but this 
is impossible, for the functions e{m(f>), where m takes on all 
integral values, already constitute a complete orthogonal 
system. We have, however, already shown by a more direct 
method (§ 8), which did not involve Parseval’s equation, that 
the system of primitive characters e{m<j>) was complete. It is 
therefore natural to consider Parseval’s equation as the simplest 
case of the general group-theoretic completeness theorem men- 
tioned in § 11. 
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The character of the representation (5/ of the 2- dimensional 
unitary unimodular group U = U * is given by (9.6). Writing 

6 = e{u>), A = e — s"' = 2t sin a», ^ AA<fty = da, 

Lit 

we have 


f Xi Xo = 

(U = 0 


(1 {f=-g) 

lo u^g)' 


(12.1) 


This leads us to suspect that da is the volume of that portion 
of the group manifold occupied by those elements a of the group 
whose angles of rotation lie between a> and oj + dw. [The 
total volume of the group manifold is then 

f A/ldo) — 1.] 

Ltt J 


If this is correct, (12.1) are the orthogonality relations predicted 
by the general theory, and the equation 


da 


Ltt 


defines the density of the various classes of the group. In the 
last chapter we shall actually carry through the determination 
of volume and verify these results. 

If there were yet another irreducible representation, with 
character x, | — A • x would be an odd periodic function 

of Cl) with period 27t which would be orthogonal to all the functions 
= A • Xfj l^he functions 

sin CO, sin 2co, sin 3a>, • • *. 

But these latter are already a complete orthogonal set for 
the domain of odd periodic functions, and consequently ihe 
(y*— 0, 1, 2, • • •) constiUlte a complete system of irreducible 
representations of the group U. A direct proof, which is inde- 
pendent of Parseval’s equation, is also to be found in Chap. V, 
§ 16 — indeed, it is there carried through for il„ in an arbitrary 
number n of dimensions. 

The Clebsch-Gordan series 

X/Xff = x^.a + X/ii>-2 + • • • + X|/-!7! (12.2) 

for the characters x/ is readily verified. If we know on general 
grounds that the character of a representation specifies it uniquely, 
this equation can be used as a proof of the reducibility of X 
into irreducible components with characters as on the right. 
Since the characters are much more readily handled than the 
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representations themselves this principle offers a very powerful 
method for obtaining assertions concerning representations. 
Let f ^ g and multiply equation (12.2), which is to be verified, 
by A : 

= {v=f+g,f+g-2,‘-,f-g). 

V 

The product of 

== — £-■(/+ 1) with ^ 

is the difference of two sums ; the one is 

£/+-^+i _|_ i~9^ 

the exponent decreasing by 2 from term to term, and the other 
is obtained from this one by replacing all exponents by their 
negative. Hence the product is in fact 

^'=/+g,/+g-2, • • 

The representations ^ ®y“ (/ — 0, 1, 2, • • *) constitute a 
complete set of inequivalent irreducible representations of the 
augmented group U2. To establish this we first note that in an 
irreducible representation of u’ the matrix associated with the 
element t must be a multiple of the unit matrix, for it commutes 
with the irreducible system of matrices constituting the repre- 
sentation. Furthermore, tt = I, so this matrix can only be 
4- 1 or — 1. Since the matrix associated with t is a multiple 
of the unit matrix, and since the extension of U to u’ involves 
the addition of a single element t, the representation must remain 
irreducible on restricting the group u' to the sub-group U. Hence 
every irreducible representation of 1I2 is obtained by supplement- 
ing the irreducible representations of U2 by the association 

(, — > 4“ 1 or t —> — 1. 

If §, §' run independently through complete systems of 
inequivalent irreducible representations of the two (finite or 
closed continuous) groups g, g', respectively, then the § X 
constitute a complete system of inequivalent irreducible rep- 
resentations for the direct product g X g'. To prove this we 
note that since the primitive characters of g constitute a 
complete orthogonal system for class functions of the element s 
which runs through g and the primitive characters of g' 

do the same for g', the totality of the products xW * X con- 
stitute a complete orthogonal system for the class functions of 
the element ( 5 , s') which runs through the group g X g'. 

The representations g introduced in § 6 constitute a com- 
plete system of irreducible representations of Cg when /, g run 
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independently through the numbers 0, 1, 2, • • • ; we here 
only mention this fact without going further into it. 


§ 13. The Algebra of a Group 

We return for the present to finite groups. In order to be 
able to express the completeness theorem we associate with 
each function x{s) on the group manifold of the finite group g 
its “ Fourier coefficient matrix,” the group matrix. 

X = 2:x(s)U{s), (13.1) 

s 

where S;) : s U{s) is a representation of g. The trace of X, 

i = Ix{s)x{s), (13.2) 

S 

is the Fourier coefficient of x(5) with respect to the character 
x{s) of ,S^. It is liere desirable to consider the function x{s) as 
a single quantity x in the group domain ; each element s of the 
group is a dimension in “ group space ” and the number x{s) 
is the ^-component of the quantity x. We may express the 
quantities themselves symbolically in the form 

X - • s. (13.3) 

The matrix X is associated with the quantity x in the repre- 
sentation : x X in ip. Addition of “ group quayitities ” and 
miiltiplicatio)i of them by a number are introduced in the usual 
way : x y has the comjxments ;i:(5) y{s) and ax the com- 
ponents a • x{s). Group quantities consequently behave like 
vectors in an /i-diincnsional space, where h is the order of the 
group. The following definition of multiplication of tioo arbitrary 
group qua)itities x and y is suggested by (13.3) : 

7 -- xy 2"'V(/)r(/')^F -- yz[s) • s 

e V » 

where 

z{s)~-~= Ex{l)y{t'). (13.4) 

tv ^ It 

This last equation, in which the sum is to be extended over all 
pairs of elements /, /' whose product is s, delines the product z 
of the quantities x and 7. We denote this product by X7 and its 
components by xy{s) ; this is not to be confused with x{s) • y(5), 
the ordinary product of the two numbers :^(^), y(^). Addition 
and multiplication of group quantities parallel addition and 
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multiplication of the group matrices associated with them by 
(13.1). Indeed, the product of 

X=Zx{s)Uis), Y = Zy{s)U{s) 

t t 

is given by 

Z = XY = £ x{t)y{OU{tn = 2:z{s)U{s), 

t,V t 

where z{s) is defined by (13.4) 

The operations to which the group quantities may be sub- 
jected : (1) addition, (2) multiplication with a number, and (3) 
multiplication with one another, satisfy the usual laws of 
ordinary algebra with two important exceptions : multiplication 
is not commutative and division is not in general possible^ i.e. the 
equation ax — b for given a 0 and b may have no unique 
solution or even no solution at all. But there does exist a 
quantity 1 having the properties of unity : la = al = a for 
every quantity a ; its components all vanish with the exception 
of the one associated with ^ = I, which is 1. A domain of 
quantities as described above is called an algebra,^ and the 
“ group quantities ” are the elements of the algebra ; care must 
be taken not to confuse these with the elements of the group 
(cf. V, §5). The association x X in the representation § 
satisfies the conditions : 

1. 1 1 , to the element 1 corresponds the unit matrix 1 ; 

2. if X -> X, ^ F and a is a number, then 

X + y X + Y , aX“> aX, xy XY. 

A representation § of the group is the same as a realization or 
representation ** of the algebra of the group by matrices such 
that these conditions are satisfied. Actually all we have done 
here is this : we have gone over from the matrices U[s) associ- 
ated with the individual elements of the group to the linear 
manifold of matrices for which they constitute a basis. 

What characterizes an element a of the algebra whose com- 
ponents a{s) define a class function } We have in general 

ax{s) = xa{s) — 2^^(ls)x{r^)j 

t t 

and a class function satisfies the equation 

a{st) = a{ts). 

Hence such an a is characterized by the fact that it commutes 
with all elements x 'of the algebra : ax = xa. Employing a 
term carried over from group theory to algebra we may say : 
those elements whose components depend only on the class of 
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conjugate group elements to which the argument s belongs constitute 
the central of the algebra. 

We are interested only in unitary representations s -> U[s). 
For such a representation the Hennitian conjugate of (13.1) is 

X = Zx{s)U{s) = Ix{s)U{s-^) = Zi{s-^)U {.<;). 

Hi i 

Hence on defining the conjugate x of the element x by 
Hermitian conjugate matrices are associated with conjugate 
elements in a unitary representation ; this characterizes unitary 
representations. An element will be said to be real if it coin- 
cides with its conjugate. We have seen that the character 
x(s} of a unitary representation satisfies this condition 

Let ^ be a ^-dimensional irreducible unitary representation 
of g. C “ being a given g-dimensional matrix, the element 

c of the algebra defined by 

^(s) = p..‘l>Us)=:^ftr[CD(s)] 

is such that c C in § ; this is readily verified with the aid of 
the orthogonality relations. Hence in the correspondejice x X 
X runs through all g- dimensional matrices. We denote the 

g 

quantity with components ^ by The set H of all 

elements of the form 

where the coefficients arc arbitrary, is naturally closed with 
respect to the operations of addition and multiplication by a 
number. But the product of two elements in H is again an 
element in H ; indeed, if c is in H and x is an arbitrary element 
of the algebra both cx and xc are also in H. We express this 
situation in a terminology paralleling that of the theory of groups : 
H is an invariant sub-algebra of the algebra P of all group quantities. 
To prove these assertions we first note that the definition (13.1), 
together with the condition that 5 -> U[s) be a representation 
yields the equation 

XlJ{s-^) - Ix{t)U{ts-^), 

t 

or, on replacing U{s~^) by U{s), 

XU{s) = ^U(sr^)x{t). 


(13.5) 
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Multiplying on the left hy C — lic.-tll and constructing the trace 
we find 

I tr [(CA) U{s)] = Zc{sl-^)x{t) = cx{s), 


whence ^ = cx is in H 




cx = z yik Cik 

i. k 

(13.6) 

and the matrix 


l|y.»ll = cx. 

(13.7) 


In the same way we can show that if c belongs to H then xc 
does also. If 

ll;r,*|l in§ 

we call 

the component of x in H. In accordance with (13.6), (13.7) this 
component is the product of x with 

* = + ®22 + ■ ■ ■ + ; 

it is 8X = xe. e is a real element belonging to the central of the 

a 

group algebra with components ~ • x{s) ; it is “ idempotent^'' 

i.e. it satisfies the equation 66 = 6. In particular, the product 
of two elements 

a === Zaik Cik, b = Zbik e,* 

of H with coefficient matrices A, B, is the quantity ab in H 
with the coefficient matrix AB. e is the 1, the “ modulus,” or 
“ principal unit,” of the sub-algebra H since sx = X8 = x when 
X is in H. The algebra H is identical with the algebra of all 
g-dimensional matrices (“ simple matric algebra ”). The “ units ” 
Cik satisfy the equations 

^ir^rk ~ ^ikt ^ir^sk ~ fo*” r =1= 5 . (13.8) 

The central of the sub-algebra H consists only of the multiples 
of its modulus 8. 

An irreducible representation : s -> U'{s) = || m'„(^) || of 
dimensionality g' which is not equivalent to § yields another 
invariant sub-algebra H' consisting of all elements of the form 

= IK 11 = 0. 
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The components of e[^ are It follows from the 

orthogonality relations existing between inequivalent repre- 
sentations that c' -> 0 in the representation If c is in H, 
then, by applying (13.6) for x = c', cc' ^ 7 is also, but since 
then X ~ 0 (13.7) yields 7 ~ 0 ; the two sub-algebras are 
independent in the sense that the product of an element in one 
with an element in the other is always 0. Hence the “ units ” 
satisfy 

(13.9) 

The modulus 

e' - 

of H' satisfies bb* = 8's = 0 in addition to e'e' = s'. 

If a{s) is a class function, a belongs to the central of P and 
if a -> A in the g-dimensional irreducible representation § 
then the matrix A commutes with all matrices X. Hence A 

is a multiple of the unit matrix : A By (13.2) we find 

that the trace a of A is * 

a == Xa{s)x{s). 

S 

In this way the entire theory of representations can be 
translated into the language of modern algebra. This leads to 
a greater frecdcmi of operation and is preferable for the expression 
of the completeness theorem. rhe orthogonality relations 
between u,f^{s)^ Bessel’s inequality 

g-tr{XX) H Vx{s)x{s). (13.10) 

where X in the sum on the left is the matrix (13.1) associated 
with x{s) in the ^-dimensional irreducible representation and 
the sum is taken over any set of inequivalent irreducible repre- 
sentations §,•••. This inecjuality is obtained by expressing 
the fact that the mean value of 2 ( 5 ) Z{s) is non-negative (cf. I, § 7), 
where z is that element obtained from x on subtracting from x 
its components in H, • • • : 

z = X — {Xxa-ei^ +•••)== x — (xe +•• •)• 

Since the characters constitute an orthogonal system we also 
have the Bessel inequality 

+ - • • <h^Zx{s)x{s) (13.11) 


Cf. also Appendix 2 at the end of the book. 
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where ^ is defined by (13.2). The completeness theorem asserts 
that in both cases the equality sign holds when the sum is extended 
over a complete system of inequivalent irreducible representations, 
where in (13.10) x{s) is any function on the group manifold and 
in (13.11) any class function. The second relation is a special 

case of the first, since for class functions X — -1. 

s 

If the abstract group Q is a finite continuous group which 
is closed in the sense of § 12, instead of a finite group as above, 
the sums must be replaced by integrals ; the measure of volume 
on the group manifold is introduced as in § 12. We then have 
in place of (13.1), (13.4) : 

X = 1:^(5) f/ (.9)^5, 

xy{s) = ^x{st~^)y{t)dt ~ ^x{t)y{t~'^s)dt. 

The modulus 1 of the algebra must have as components the 
values of a function 1 ( 5 ) which vanishes everywhere on the 
group manifold except at the point s == \ and must there be 

so large that ^l(5)d5= 1. Such a function does not exist, but 

we can construct functions approximating these conditions 
arbitrarily close. 

The completeness relations assert that any element x of 
the algebra of a finite group g is the sum of its components in 
the totality of sub-algebras associated with a complete system 
of inequivalent irreducible representations. The group algebra 
r is thus reduced to a set of independent simple matric algebras. 
It suffices to prove this theorem for x — i : 

1 = 8 + 8' + * * * = (^11 + • * * + ^gg) + • ’ *, (13.12) 

for on multiplying this by x it follows for all elements x. These 
assertions cannot be carried over to continuous groups in the 
form here stated; we must hold to the formulation (13.10) 
(with = instead of ^) containing an arbitrary function x{s). 
We go into the proof of these results in Chap. V, where all 
the results of this section will be derived anew and discussed in 
detail from another more profound point of view. 

§ 14. Invariants and Covariants 

We first discuss briefly the classical concept of an invariant. 
Consider, for example, the group c = C 2 of homogeneous linear 
transformations of two variables f, with unit determinant. 
Let 


+ 2b^7) + 
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be an arbitrary quadratic form in the two variables. The 
“ discriminant ” ac — is an invariant, for the discriminants 
of two forms which are such that either goes into the other on 
transforming rj by some element of c have the same value. 
We may have, instead of one arbitrary quadratic form, one or 
more arbitrary forms /, ^, • • • of given orders, n, v, • ‘ •. An 
invariant is a rational integral function I of the coefficients of 
these forms which is homogeneous in the coefficients of each of 
the forms /, ^, * * • and which has the same value on replacing 
these coefficients by the coefficients of the forms /', • into 

which are transformed by an arbitrary transformation 

o of C affecting the variables 17. 

The coefficients a^, a,, •••,«„ of an arbitrary form of order 
n in the variables -q undergo a certain linear transformation 
on subjecting the variables to a transformation a of c, and the 
correspondence between a and this transformation constitutes 
a representation of the group C. The same is true for the totality 
of monomials 

«0 ‘t? •••«»" (>'0 + + • • • + = »') 

of order r in these coefficients. A homogeneous polynomial 
/ of order r in the a, is a linear combination of these monomials. 
We thus see that if I is of given degrees r, p, • • ’ in the coefficients 
of the arbitrary forms /, <^, • • • it is a linear combination of 
quantities which constitute the substratum of a definite re- 
presentation of c ; this representation is known as soon as we 
have given the orders n, v, • • • of the forms /, <^, • • • in the 
variables tj and the degrees r, p, • • • of the invariant I in the 
arbitrary coefficients of Discarding the all too special 

formal algebraic assumptions involved in the “ classical ” 
concept of an invariant, and which the theory of invariants has 
from the beginning attempted to outgrow by generalizations in 
various directions, we may express the concept in modern 
group-theoretic language as follows : 

Let jp : 5 -> U{s) be a given representation of an abstract group 
9 in an n- dimensional representation space 91 with variables ; 
a linear form in the is said to be an invariant in the representation 
space 'iR of ^ if it is unchanged under all the transformations U{s). 
If Ii, I2, ' ‘ ‘ are invariants in the representation space of 
then any linear combination + “j/j -f- • • • of them with 
constant coefficients a^, aj, • ’ • is also an invariant. The most 
important problem arising here is naturally that concerning the 
number m of linearly independent invariants in the given 
representation space. If y*, • ' • ym constitute such a com- 
plete set of linearly independent invariants, and if we choose as 
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co-ordinates in 9? these m quantities and n — m further linear 
forms y,n^i, ‘ yn such that the two sets together constitute 
a complete system of linearly independent linear forms in 9f{, 
the transformation U(s) is, in terms of the variables y, 

y'l = yi, ■ ‘ ym = ym : 

yUi = ^m+i, i(s)yi + ' • • + „ (s) y,., 

y'n = u„i(s) yi -f • • • + U„„{s) y„. 

If we are dealing with a unitary representation the y's can be so 
chosen that they define a normal co-ordinate system ; § is 

then reduced into m times the 1-dimensional identical repre- 
sentation y* = y and an (h — m)-dimensional representation. 
Hence the problem of finding the number of linearly independent 
invariants in the representation space di reduces to finding how 
often the identical representation with the character 1 is con- 
tained in the given §. But by formula (11.8) the solution of 
this problem is given by 

m=:9R{xW}, (14.1) 

or : the mean value of the character x of which is always a 
noU’Uegative integer^ gives the number of linearly independent 
invariants in the representation space of 

The formula (14.1) answers the principal question arising 
in the linear invariant theory, and we now proceed to an ex- 
tremely brief discussion of the algebraic invariant theory. Let 
©,§,••• be representations of the same abstract group g in 
the spaces with variables • • *. We consider rational 

integral functions /(a;,-, y^^, • • •) which are homogeneous in the 
variables a;,-, homogeneous in the variables y^t, etc. If on sub- 
jecting a;, y, • • • to those linear transformations corresponding 
to the same arbitrary group element .9 in the representations 
• 7 remains unchanged, then it is said to be a rational 
integral invariant of the system [@, §,’•'] of representations. 
If the orders p, • of the function / in the variables a;,, y^t, * * * 
are given, the problem reduces to the one discussed above ; 
for the monomials in these variables which are homogeneous 
of order p in the a;,-, homogeneous of order q in the yjt, * * * con- 
stitute the substratum of a representation obtained in a certain 
way from ©,§,•• •. But if we consider simultaneously in- 
variants of all possible orders belonging to the system [®, §,•*•] 
we are confronted with new problems. The most important of 
these, which is answered in the affirmative by the so-called 
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fundamental theorem of the theory of invariants is : Do there 
exist a finite number of invariants such that all others can be 
expressed rationally and integrally in terms of them ? This 
involves the question of algebraic, rather than linear, dependence 
between the invariants. We only mention this higher branch 
of the theory of invariants, and do not go into it further, as it 
bears no direct relation to quantum mechanics.^® 

In addition to invariants or scalars, covariant linear 
quantities, such as vectors and tensors, play an important 
role in physics. Let g be the group of all linear transformations 
between the normal co-ordinate systems in space or in space- 
time, e.g. the 3-dimensional group of Euclidean rotations or 
the group of Lorentz transformations, and let Sp : s U{s) be 
an n-dimcnsional representation of g. A covariant quantity of 
kind Jp is an entity having n compo}ients a^, * * *, relative 

to any given co-ordinate system for the variables of the transforma- 
tion group g and which is such that on going over to a new co- 
ordinate system by means of the tra^isformation s of g the new 
components a, are obtamed from the old by the corresponding 
transformation U[s) of ,Sp. If .Sp is irreducible such a quantity 
is said to be primitive or simple. Physical quantities are generally 
simple. Thus, for example, the entity whose components are 
the electro-magnetic field strengths in the 4-dimensional world 
is described as an “ anti-symmetric tensor of order 2 ” rather 
than merely as a “ tensor of order 2 ” ; we shall see in Chap. V, 
§ 4, that it is therefore a simple quantity. The reduction of 
a representation into its irreducible constituents implies the 
reduction of the corresponding kind of quantities into simple 
quantities. It would appear that the only simple quantities 
with which we deal arc tensors which arc characterized by 
certain symmetry conditions in addition to their order. We 
shall prove this theorem for the complete linear group C and for 
its unitary sub-group U in Chap. V ; it asserts that all repre- 
sentations of c (or u) can be obtained by reduction from the 
powers c, (c)^ (c)^, • • • and that the irreducible constituents 
of (c)f arc obtained by imposing certain symmetry conditions. 

We must accordingly generalize the problem of the linear 
theory of invariants in the following manner. Consider two 
unitary representations : cr — > .s, .Sp : a -> 5 of the abstract 
group g with elements a ; let their dimensionalities be n, N 
and let be irreducible. We wish to determine all covanant 
quantities of kind I) in the representation space of Calling the 
variables in this representation space x \ , which undergo the 
transformation S under the influence of a, such a quantity 
/ has n components /j, ••*,/„ which are linearly independent 
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linear forms in the variables Xi. When the Xi undergo the trans- 
formation 5 the n linear forms /« go over into new ones which 
are obtained from the /« (in which the variables Xi have been 
transformed in accordance with S) by means of the transforma- 
tion s of \!j. If there exist two or more covariant quantities 


I = (/i, h 




ID 


of the kind ^ in the representation space of then any linear 
combination a/ + a'/' with constant coefficients a is 

again a quantity of the same kind. We ask for the number m 
of linearly independent quantities of this kind. The answer is 
that m is equal to the number of times the irreducible representation 
f) is contained in §. Hence if ^ the characters of 1), 
we have 

m = SR{X(.)x(^)}. (14.2) 


In order to prove this statement we choose the co-ordinate 
system x^ in the representation space of ^ in such a way that 
the matrices of § are reduced into their irreducible constituent 
sub-matrices, the m representations f) : — • • • — ^ {) 

being separated out first. The remaining constituents 
• • • are inequivalent to I). Denote the variables in the corre- 
sponding invariant sub-spaces by 


/ f ff ff 

1 * * *> 1 ^1 » * * *> f 


Um) . . . ^(m) . . . , 

1 f '^n ) 


The matrix S is completely reduced into the sub-matrices 
s' — .5, • • •, 5^"^^ ~ s ; • • • arranged along the principal 

diagonal. Let 

yi~ aiiXi -\r ' • • + 'I 

yn — ^nl + ’ ’ * + 

be a covariant quantity of the kind )). We can write this in the 
form y = Ax in terms of the column a; of the N variables 
the column y of the n variables ya and the matrix A — 

The requirement that 7 be a quantity of kind ^ means that 
when X is replaced by x' ~ Sx, y goes over into y' — sy, or 

sy = ASx, sAx = ASx, sA — AS. (14-3) 

Corresponding to the reduction of ;tr-space into irreducible 
sub-spaces, the matrix A of the correspondence of A;-space on 
y-space is reduced into matrices A'^ • • *, A^^^ ; A^^^ • * • 

consisting of the first n rows, • • •, the m^^ set of n rows, * • *, 
• • • of A. Equation (14.3) then becomes 

sA' = A's, • • •, sA(^^ = A^^h ; sA^^ . . . 
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It follows from the fundamental theorem (10.5) on representa- 
tions that A', • • •, are all multiples of the «-dimensional 
unit matrix and that the remaining • • • are all zero. 

But this is just our assertion that y = (yj, y^, • • *, y„) is a 
linear combination of the tn quantities 

X {xu • • •, ArJ, 

^ {x[^\ 4 ^), . • •, 4 ^)) 

of the kind f). 

§ 15, Remarks on Lie’s Theory of Continuous Groups 
of Transformations 

In § 12 we made use of the concept of infinitesimal elements 
of a group in order to establish a method of measuring volume 
on a continuous group manifold. We here discuss this concept 
in detail for the 3-dimensional group b of rotations in Euclidean 
space. This group serves to describe the mobility of a body 
in Euclidean space, one point 0 of which is fixed in space. Each 
possible position of the body can be considered as arising from 
any given initial position by an operation of b. A material 
substance distributed throughout the space or any portion of 
it moves as a rigid body about 0 if the position of each of its 
elements at a given moment is associated with its initial position 
by means of a correspondence belonging to b. This is the 
description of the motion of such a rigid body which compares 
the position in any moment directly with the initial position, 
ignoring the intermediate states which it has assumed in going 
from the one into the other. But it seems more natural to 
consider it in terms of a continuous motion in which the position 
of the body undergoes an infinitesimal rotation from moment 
to moment, so that the motion as a whole is the integration 
of a series of infinitesimal operations of b. On employing an 
auxiliary variable t in order to avoid the use of infinitesimals 
and thinking of this parameter as time, the velocity field 
dx = Xj dy = dz == z of an infinitesimal rotation is defined 
by [cf. I, § 6] 

dx = bz — cy, dy — cx — az, dz — ay — bx, (15.1) 

where the constants a, b, c are independent of position {x, y, z). 
These velocity fields, which obviously constitute a 3-dimensional 
linear manifold, are the infinitesimal elements of b ; they are 
the “ vectors ” which define the linear space tangent to the group 
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manifold at the point which represents the unit element I. 
The continuous motion of a rigid body about 0 is characterized 
by the fact that at each moment its velocity field belongs to 
the 3‘parameter linear family (15.1). We may take as a l3asis 
of this family the three elements Dy, Dz obtained by choosing 

a — 1, i = 0, = 0 ; a = 0, b ~ c — 0 \ a = 0, 6 ^ 0, “ 1. 

We call these “ the infinitesimal rotations about the x-, y- and 
2 -axes.” 5. Lie was the first to undertake a systematic study 
of the construction of transformation groups from their in- 
finitesimal elements. In fact, once they are known all the 
substitutions of the continuous group can be generated by 
integration, i.e. by successive application of such infinitesimal 
elements — at least, all those which belong to the same connected 
” sheet ” as the identity. (Example : the proper orthogonal 
transformations can be obtained from the infinitesimal ones, 
but not the improper transformations with determinant — 1). 

In general, consider a continuous r-parameter transformation 
group and let the group manifold be described in terms of 
the parameters Sr in the neighbourhood of the unit 

point, at which they vanish. A portion of the group manifold 
is thereby mapped in a one-to-one continuous manner on a 
neighbourhood of the origin in the r-dimensional number space 
of the parameters s. Let the u-dimensional point-field of the 
transformations be described in terms of co-ordinates Xi, X 2 , • * •, x\ 
in the neighbourhood of the point under consideration, and let 
the correspondence x x' : 

^i(^l ^2 ’ * 1 } * * *) ^r) 

be associated with the element (5j, * • *, ^r) r)f the abstract 

group in its realization by the transformation group. The 
infinitesimal transformation x —> x dx obtained by assigning 
the infinitesimal increments ds to the parameters 5 in the neigh- 
bourhood of 5 = 0 is given by 



the parentheses indicate that the differential quotients are to 
be computed for = 0, • • % Sr — 0. We postulate a material 
substance which fills the point-field and which is capable of 
executing those and only those motions in which the positions 
of its elements at an arbitrary moment t' are obtained from their 
positions at time t by a transformation of @. Again its motion 
can be more simply described as the result of successive deforma- 
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tions corresponding to infinitesimal operations (15.2) of our 
group ; the velocity field must at any time have the form 



where a^, • • •, are constants independent of position. This 
r-dimensional linear family constitutes the infinitesimal group of 
motions of our substance. It is to be observed that the application 
of these infinitesimal processes to our transformation group 
presupposes that the functions are differentiable with respect 
to s at the point 5=0. In the theory of abstract groups the 
point-field is the group manifold itself and we take as a realization 
(left-)translation. In the neighbourhood of the unit element 
5 = 0, i = 0 we have, as law of composition, 


{st)a = • Sr] ti ■ • ' tr) [a = 1, • • r]. 


The introduction of a measure of volume in § 12 presupposes 
that the functions are, for sufficiently small /, differentiable 
with respect to the 5 at the point ^ = 0, and that for sufficiently 
small 5 they are differentiable with respect to / at / — 0. 

The composition of infinitesimal elements of the group is 
expressed by addition of the parameters a introduced by (15.3). 
It might therefore appear as if the infinitesimal elements of an 
r*parametcr continuous group need satisfy no condition other 
than that they constitute a linear family. However, that is 
not the case ; there are further “ integrability conditions ” to 
be satisfied. The example of a sphere which rolls without 
slipping on a horizontal table shows that the possible positions 
of a body whose infinitesimal motions have but three degrees 
of freedom can nevertheless constitute a 5-dimensional manifold. 
The integrability conditions we are seeking, which involve 
second order derivatives, guarantee that this situation does not 
arise. We obtain these conditions on expressing the fact that 
the commutator of two infinitesimal elements 5, t of the 

group also is an element of the group. This commutator con- 
verges to 1 as J approaches the unit element I, whatever t may 
be, and similarly as ^ I for arbitrary .j. The commutator of 
the two infinitesimal linear correspondences A and B : 

dx — Ax, d'x — Bx 


is the infinitesimal correspondence AB — BA ; to show this 
we note that the equation 

k{s)m = r{s, ob ( oa ( 5 ) 
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leads, on writing 

A(0) = 1. B(«) = !.("),..= A 



= B, 


lim 

0 


r{s, t) - 1 

6' * t 



C, 


to the equation 

C= AB- BA. 


Our main purpose in mentioning these matters is to prepare 
the ground for an understanding from general principles of the 
commutation rules satisfied by the three infinitesimal rotations 

^Xt Dy, Dz ■ 


0 0 

0 0 

0 

-1 

J 

0 0 1 

0 0 0 


0 

1 

-1 0 

0 0 

0 1 

0 


-10 0 


0 

0 0 


They are, as is readily shown, 

Dx Dy DyDx — D,, Dy Dj Dy == Dx, 

DzDx- DxDz^- Dy. 


(15.4) 


(15.6) 


We could, of course, take the unimodular unitary group U 2 
in two dimensions as fundamental, instead of the group ba of 
rotations. We denote the two variables which undergo the 
transformations a of the unitary group by as in § 8. In 
consequence of the correspondence a s, which was established 
there by means of a stereographic projection, the 3-dimensional 
rotation group now appears as a representation of U 2 . We can 
take as a basis for the 3-parameter linear manifold of infinitesimal 
operators of U 2 the three particular operators — 


^ 5 • 

2 t * • 

d^ = 

1 

dy] — 

1 i 1 

2i^ ' 



d^=- 

1 

drj “ 


► (15.6) 

1 

d^ = 

1 , 


1 


— 9 • 


dr] ~ 

-Yi^'j 



here, in agreement with (8.15), 


0 1 

> = 

0 

—i 


1 

0 

1 0 


i 

0 


0 

— 1 


They are the infinitesimal transformations of U 2 corresponding 
to the three infinitesimal transformations £)y, of ba in 
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virtue of the correspondence a s ; that this is in fact the case 
is readily seen from (8.10) or 

~ + ij'*, y ~ + t}^), z 2^rj. 

Given any representation S^:a-^U{a) of U 2 , its infinitesimal 
operators with matrices 

i (M„ M^, M,) 


corresponding to the infinitesimal operators (15. G) in U 2 
satisfy the same equations (15.5) as the : 

MyM, - iM,, • • •. (15.7) 

I'he matrices A/y, are of course Hermitian. For reasons 
which will appear in the following chapter we call these the 
components of ynoment of monie^ilmn (or angular ynomentum) 
of the representation and 


^ -- M 'i + A / ; + M ; 

the square of the magnitude of the moment of momentum. If 
ip' are two representations with angular momenta 3R, 9JF 
then, in accordance with the general formula 11, (10.4), which 
governs the composition of infinitesimal operators by X -multi- 
plication, the representation X f)' has as moment of momentum 

m X 1) + (1 X w). 

We next calculate the moment of momentum of the 
irreducible representation (j //2) of 1 I 2 . It will be 

found more convenient to employ in place of 

transformations 

^2 + iSy) : di = 7], dr] = 0 

1 ( 5 , - 1 - 5 ,): d^^-0, dr] --=^i 

In general 

and on substituting in this the variables 

x{m) = {r s=2j, r - s = 2m) 

Vr ! s ! 
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of the representation space of 2)j, we find that the three infinitesi- 
mal transformations of Uj defined by (15.6), (15.8) induce in this 
space the transformations 

^ {Sx -f iS,j) : dx{m) = Vr{s + 1) x{m — 1) 

= V^(i + ^)(y — w + 1) x{m — 1), 

^ [Sx — iSy) : dx{m) = V s{r + 1) x{m 4- 1) 

= \/{j — m){j -1- m + 1) x{m + 1), 

T — S 

Sz • dx{m) — — 2 — x{m) == m x(m). 

Hence 


(M* + iMy){m, m — \) = y/ {j m){j — w -f- 1), 
(M* — iMy){m, m+ 1 ) = Vij — m){j + m 4- 1), ■ 


(15.9) 


M,{in, m) = m. j 

All other components (w, m') vanish. is a multiple of the 
unit matrix in 31^ : 


for it follows from 


= iU + 1), 


(M^ 4- iMy){M, - iMy) ^ Ml + Ml- i(M,My - MyM,) 


that 


= Ml -\- Ml + M, 

M* = (M^ 4- iMy){M^ - iMy) - A/,4- Ml, 


and from this and (15.9) that 

M^{in, m) — {j 4- w)(y — w4-l) — w4-»i* = j{j 4- 1). 

If on reducing an arbitrary representation § the irreducible 
representation is found to occur exactly gy times, then A/* 
has j{j 4- 1) as a [(2j 4* l)g'>]'fold characteristic number and 
Mg has the characteristic number m with multiplicity 

^gi U= |w|, |»i| 4- 1, • ' •)■ 

} 

From this we again see that the multiplicity gy with which 2)y 
occurs in the reduction of § is uniquely determined by §. 
These infinitesimal operations can be used to give a relatively 
elementary constructive proof of the fact that the 35y are the only 
irreducible representation of U 2 .^* 


§ 16 . Representation by Rotations of Ray Space 

In quantum theory the representations take place in system 
space ; but this is to be considered as a ray rather than a vector 
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space, for a pure state is represented by a ray rather than a 
vector. Two unitary transformations U and eU which differ 
only by a numerical factor s of absolute magnitude 1 arc con- 
sequently to be considered as the same, for they 

determine the same rotation of the ray field. In a “ ray repre- 
sentation^'' which associates with each element 5 of the abstract 
group Q a unitary rotation U{s) of the rays of n-dimensional 
representation space, the gauge factor e{s) may be taken 
arbitrarily for each unitary matrix U{s) ; if g is a continuous 
group we choose it, however, in such a way that U{s) depends 
continuously on s. The condition for a representation is now 
only 

U{s)U{t):^U{st), (16.1) 

i.e. 

U{s)U{t) - S{s, t)U{st), (16.2) 


where t) is a numerical factor, of modulus 1, depending on 
.s and t. If by change of gauge U{s) is replaced by b{s)U{s)^ 
8(^, t) is replaced by 

6(5/)£~^(5)£~^(/)8(5, t). 


In the equation 


2:x{s)U{s), 


8 


defining the connection between the components a:( 5) of an 
element x of the algebra of the group and the group matrix X 
which represents it, the ^( 5 ) are also dependent on the gauge 
and arc sent into e(s)x{s) on the change of gauge defined by 
U{s)^s{s)U{s). In order that the multiplication law for two 
elements x,y shall, as we require, parallel the multiplication of 
the matrices which represent them wc must define 

2:8(/, 0-r(Oy(0 (16.3) 

in terms of the chosen gauge. I'hc condition 

x(5“^) “ ^( 5 ) 

for a real clement x is only appropriate if the gauge is so chosen 
that f7(.9"^) is the matrix reciprocal to U{s). The algebra of 
the group is to be adapted in this way to the ray representation 
under consideration, whereas in dealing with “ vector repre- 
sentations ” it is uniquely determined by the law of composition 
of the group alone. 

Examples. 

I. The I-dimensional representations are now entirely 
uninteresting, for any I-dimensional matrix ~1. But under 
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certain circumstances Abelian groups may possess multi-dimen- 
sional unitary ray representations^ whereas any irreducible 
unitary vector representation of an Abelian group is necessarily 
of degree 1. 

We first investigate the simplest example, a finite cyclical 
group (a) of order /i, consisting of the elements 

I, a, • • •, a^~^ {a^ — I). 

Let the element a correspond to the unitary matrix A in the 
ray representation ; then A^ ~ al is necessarily a multiple of 
the unit matrix. Since a is of modulus 1 we may change the 
gauge in such a way that A goes into Aj'Voi] then A^ == 1 
and the correspondence a^ -> A^ is a vector representation of 
the cyclical group. Hence by introducing an appropriate 
change of gauge the ray representation can be made into a 
vector representation, 8(5, t) being then 1. 

H. The simplest example of an Abelian group which gives 
rise to multi-dimensional irreducible ray representations must 
consequently be non-cyclic. Consider the group consisting of 
the four elements I, a, b, c with the multiplication table 




ca ~ ac 


ba = c. 


( 16 . 4 ) 


A ray representation 93 is given by 


10 -j 

It 0 1’ 


U{c) 


The normalization is here chosen in such a way that 
U\a) - U{a)U{a~^) = 1 


0 

— 1 


( 16 . 5 ) 


and similarly for 1 , 6, c. The algebra defined by ( 16 . 3 ) for this 
representation is non-cornmutative in spite of the Abelian 
nature of the group ; it is the algebra of complex quaternions. 
On denoting the elements of this algebra by 

X k\ -f-Aa + /x6 + i/c, 

the “ units ” I, a, c have the same multiplication table as 
the corresponding matrices U : 


1 

a 

b 

c 


1 1 

a 

b 

c 

(The product xy occupies 

a a 

1 

ic 

— ib 

the intersection of the 

b b 

— ic 

1 

ia 

row X with the column y.) 

c c 

ib 

- ia 

t 
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The “ real ” quantities are those for which all components 
K, X, fjL, V are real. Since in the calculus of quaternions I, ia, 
ib, ic arc taken as the fundamental units, they are those whose 
scalar component k is real and whose vectorial components 
A/i, /i/i, v/i are purely imaginary. 

III. The group U = U 2 of unitary transformations a in two 
dimensions with determinant 1. ('onsider a representation 
a -> U{a) by rotations in n-dimensional ray space. On changing 
the gauge in such a way that U{a) goes into 

U{a) ; Vdet U{a), (16.6) 

the determinant of the new U{a) is 1. The only possible diffi- 
culty consists in the fact that the root 

e{a) ~ ^det U{a) 

is multiple-valued. It is “ locally single-valued, i.e. if we 
have chosen a definite one Sq of the n values for the point 
a “ oTo, we can uniquely determine the root e(cr) in a sufficiently 
small neighbourhood of (Jq in such a way that it depends con- 
tinuously on a and goes over into for a = cxq. Hence we can 
continue the determination of the root for cr == ctq in a unique 
manner along a path in the group manifold, starting in 
The only question is whether e{(j) returns to its original value 
when we allow a to describe a closed path. This is to be answered 
in the affirmative, since the group manifold of u is simply conyiected 
in the sense that any closed curve can be drawn together into 
a point by a continuous deformation. For in accordance with 
equation (7.5) the elements of the group arc mapped in a one- 
to-one continuous manner on the quadruple [kXixv] of real numbers 
which are subject to the condition 

k2 + A2 f q.. ^2 1 

Hence the group manifold has the same topological properties 
as a 3-dimensional sphere in 4-dimensional space. These con- 
siderations thus show that the n^^^ root (16.7) is broken up into 
n single-valued continuous functions over the entire group 
manifold. The method of proof here employed, which is of 
fundamental importance in the whole of mathematics, is perhaps 
best known to the reader in the proof of Cauchy's integral 
theorem ; it follows from tlie fact that the integral of an analytic 
function is locally single-valued, that it is single -valued in the 
large if the region in li^hich we are operating is simply connected. 

The result of our topological considerations showed that 
the formula (16.6) defines n single-valued continuous functions 
f/(cr). One of them is such that in it (7(1) is the unit matrix; 
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we henceforth denote it alone by U{a). On writing the equation 

U{a)U{r) = S{a, T)U{ar) (16.8) 

for T= I, and taking into account the fact that U{1) =- 1, we 
find h{s, I) = 1. On forming the determinant of both sides of 
(16.8) we obtain the equation 

1 = [8(a, t)]". 

8(or, r) is consequently an root of unity which depends con- 
tinuously on T for fixed a and which reduces to 1 for r ^ I ; 
hence it is identically equal to 1, and (16-8) becomes 

U{g)U{t) - U{aT), 

Consequently the only ray representations of U 2 are also vector 
representations^ and our considerations show that this theorem is 
valid for any continuous group lohose elements constitute a simply 
connected manifold. On going over to the 3-dimensional rotation 
group ba by stereographic projection, all even those with 
half-integral j, are single-valued when considered as ray repre- 
sentations. Any single-valued continuous ray representation of 
ba is reducible into irreducible constituents, and the only irre- 
ducible ray representations are the [j — 0, 1/2, 1, 3/2, • • •) 
obtained earlier in the chapter. But ba is not simply connected ; 
we must resort to a two-sheeted covering surface, similar to 
a Riemannian surface but without cuts or branch points, which 
is simply connected. This accounts for the fact that there 
exist irreducible ray representations of ba which may be single- 
or double-valued vector representations, but there cannot exist 
multiple-valued representations of higher degree. 

I have been able to prove the same theorem for the n-dimen- 
sional rotation group (n ^ 3).^^ This means that there exist 
two closed continuous motions (i.c. motions which lead back 
to the initial state) of a rigid body, which is free to rotate about 
a fixed point 0, such that any other closed motion can be con- 
tinuously deformed into one of the two. One of these may be 
taken as rest^ and the other is such that it cannot be continuously 
deformed into rest. 



CHAPTER IV 


APPLICATION OP' THP: THEORY OF GROUPS 
TO QUANTUM MECHANICS 

A. The Rotation Group 

§ 1. The Representation Induced in System Space by 
the Rotation Group 

I N accordance with III, § 8, we can interpret the theory of 
a single electron in a spherically symmetric electrostatic field, 
as developed in II, § 5, in the following manner. A rotation 
of physical space, i.e. an orthogonal transformation from the 
Cartesian co-ordinates xyz into xyz\ induces a unitary trans- 
formation U{s) : fp tfj' defined by 

ip\x'y'z) ~ 4^{xyz) (l.l) 

in the system-space of the electron, the vectors of which are 
the wave functions ^[xyz) describing the state of the electron, 
d'he correspondence .9 -> Ij{s) is a definite representation of 
infinitely many dimensions, of the rotation group This 

representation (S can be reduced into its irreducible constituents 
and it is found that each with integral I occurs an infinite 
number of times. The total system -space iR is correspondingly 
decomposed into mutually orthogonal sub-spaces ; $R(n/) 

has 21 -|“ 1 dimensions and the rotation group induces the 
representation in it. If we introduce in addition the im- 
proper rotations (bg) always appears in 6 with the signature 
(— l)fi The oo-dimcnsional sub-spaces associated with 

n 

the various values of / are uniquely determined, but their further 
decomposition into the summands dl{nl) is quite arbitrary. In 
particular, this can be done in such a way that the energy of 
the states composing 5R(n/) has a definite value E{nl). 

We now calculate the operators induced in system-space 
by the infinitesimal rotations of physical space. Denoting the 
increase fi'{xyz) — ^{xyz) by equation (l.l) becomes 
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/or the infinitesimal rotation s which sends 


X, y, z into x' = x dx, y' = y + dy, 2 ' = 2 + 


Taking as s the three infinitesimal rotations D^, Dy, in turn 
[III, (16.4)] and writing the corresponding infinitesimal unitary 
operators in the form 

# |(L„ Ly, L.#, 

we find 






( 1 . 2 ) 


/tS is accordingly the moment of momentum [cf. II, (4.9)]. 

On going over from one electron to two, the vectors of system 
space are the functions ^{xiy^z ^ ; of the Cartesian co- 

ordinates of both electrons. The unitary transformation 
JJ : ^ 0' induced in system-space by the rotation s is now 

defined by the equation 

ip'{x[y[z[ ; x'^2Z2) = ^x^y^z^ ; x.^y^Zz), 


where x[y[z[ and obtained from x^yiZi and 

by the same orthogonal transformation s. This situation can 
be described as follows : The state space of the system con- 
sisting of two electrons is 91 X 9^ and the representation 
induced in it is (S X (S. 

This representation is, as we see, determined by the kine- 
matical constitution of the system alone, and is in no way 
influenced by the dynamical relationships ; the rule for X - 
multiplication for the induced representation on composition 
of partial systems presupposes only kinematical, not dynamical, 
independence of the partial systems. 

We can, without further trouble, formulate the situation 
discussed above in terms of the general scheme of quantum 
mechanics in a manner which is independent of the particular 
assumptions of S chr a dinger' s scalar wave theory. This is all 
the more important since it has all along seemed doubtful 
whether the matter waves could be described in terms of a 
single state function ip. We set up an analogy between the actual 
displacement of the state of the system in time and the virtual 
change produced by an arbitrary rotation of space. The 
transition from time t to time t' changes the (arbitrary) state 
J at time t into a state j' at time obtained from J by a unitary 
transformation U corresponding to a displacement of the time 
axis which sends t over into The displacements along the 
time axis constitute a one-parameter continuous group which is 
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isomorphic with the group of transformations U associated 
with them in system-space. The former group is generated 
from the infinitesimal displacement t dt, and it therefore 
suffices to give the infinitesimal unitary operator 

J rj 

associated with it in system-space. We called the Hermitian 
operator H the energy. 

On subjecting the physical system (or the spatial co-ordmata 
system in terms of which it is described) to a virtual rotation 5, 
the state J goes over into another state j'. Since nothing 
intrinsic to the system is changed thereby and since the state 
space is linear and unitary, the transition U[s) : J -> 
associated with s must also be linear and unitary. As in the 
case of the group of actual displacements in time, this group 
of virtual rotations in space must induce a certain representation 

in the system space ; this latter is more properly to be 
considered as a ray, rather than a vector, space. But if we go 
over from tne rotation group to the unimodular unitary group 
II2 (or 112) by stcreographic projection (III, § 8 ) and take this 
latter as fundamental, it is, in accordance with III, § 16 , not 
necessary to distinguish between ray and vector representations. 
The group of proper rotations can be generated from its infini- 
tesimal operations, and we may take as a basis for these the 
infinitesimal rotations Z)j., Dy, 1 )^ about the a'-, y-, and ;:-axis. 
It then suffices to know the infinitesimal unitary transformations 

rfj - l(M,, M,)i 

which they induce in system space. We call the real physical 
quantities of the system which are represented by the Hermitian 
operators .M^, My, A/, the .v-, y-, ^;-components of the moment 
of momentum In order to express them in terms of the 

usual units they must, as was also the case with the energy, 
be multiplied by the quantum of action h. The ynoment of 
momentum pluys the same role with respect to the virtual rotations 
of space as the energy with respect to the actual displacements in 
time. 

One argument for the appropriateness of our definition of mo- 
ment of momentum is that in the case of the Schrodinger theory 
it leads to the usual formulae of classical mechanics. As a further 
justification we prove the general theorem that the moment of 
momentum so defined is constant in time. We saw in II, § 8 , 
that the necessary and sufficient condition that the physical 
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quantity represented by the Hermitian operator A be constant 
in time was that A commute with the Hermitian operator H 
induced by the infinitesimal displacement of time. In exactly 
the same way we can show that the commutativity of A with 
Ma;, My, Mz constitutes the necessary and sufficient condition 
that the quantity represented by A remains unaltered under the 
virtual proper rotations of space, i.e. that is a scalar with 
respect to these rotations. Now the energy is a scalar, hence 

HM^ - M^H ^ 0, • • •. 

But, on the other hand, these equations assert that My, Mz 
are constant in time. 

The infinitesimal rotations generate only the group of proper 
rotations ; in order to obtain the complete orthogonal group we 
must supplement them with the reflection i in the origin, or 
extend the group U 2 to the group iL by the addition of the ele- 
ment I (III, § 8). L will induce a unitary operator I in system 
space which commutes with all U[s), in particular with the 
moment of momentum SJi — {M^, My, Mz), and which satisfies 
the equation // = 1 ; this shows that I is Hermitian, as well 
as unitary. A quantity A which is unchanged by reflection 
must commute with / ; hence, in particular, the energy H 
must commute with /. The physical quantity represented by I, 
which we call the signature, is constant in time, as it commutes 
with H. It has, in common with all quantities arising in group 
theory which are not associated with infinitesimal operators, 
no analogue in classical mechanics. 

We reduce the total system-space into invariant sub-spaces 
with respect to the group of displacements in time ; such an 
invariant sub-space is carried over into itself by the generating 

infinitesimal operation dje — Since we are here dealing 

with a one-parameter Abelian group, or with a single operator H, 
this reduction can be carried to the point in which all the con- 
stituent sub-spaces are I-dimensional. The states contained in 
one of these invariant sub-spaces we call quantum states. 

We now proceed in exactly the same manner to reduce the 
representation 9? induced in system space by the group of rota- 
tions into its irreducible constituents 2)>. We make use of the 
fact that these are known to us a priori ; only the number of 
times they appear in 9? depends on the particular representation 
(Of course, we have not as yet shown that the really 
constitute a complete system of irreducible representations of 
bg, and it may seem risky to apply the process of reduction to 
the oo-dimensional representation 9?. This procedure can. 
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however, be justified on the basis of the fact that ba is a closed 
group. But in the final formulation of quantum mechanics it 
will not be necessary to base our conclusions on such general 
considerations, as the reduction into ^5), will be obtained by 
elementary means.) The entire system-space is thus decom- 
posed into sub-spaces • • • such that 91, is of dimension- 

ality 2 j -P 1 and the representation induced in it by the group 
U2 is %j. On adapting the co-ordinate system in system-space 
to this decomposition the variables fall into classes 

x{ni) {m y, 7 - I, • • - j) ; 

x'{m') {m' --- j', i' - 1, •••,-;■') : • • • ; 

under the influence of an arbitrary transformation a of U2, 
applied to the variables rj the co-ordinates of system-space 
transform in accordance with the law 

x{m) {i + 2 j, i — k = 2 m). 

With the reduction of JK or 9^ is associated the reduction of the 
angular momentum ; in the sub-space the components 
of SJi are given by III, (15.9), from which it follows that the 
square M'^ of the moment of momentum has there the fixed 
value j{j + 1). (It is evident from general considerations 
that AP must be a multiple of the unit matrix in 9^;-, for it is 
a scalar and must therefore commute with all the operators of 
the irreducible representation 2),-.) If the state of the system is 
represented by a vector lying in 91;, the s-component of its 
moment of momentum is capable of assuming the values m = 7, 

7 — - 1, • • — 7 ; the 2-component naturally only apparently 

occupies a preferred status, due to the fact that the co-ordinates 
in 9i,- were chosen in a manner which differentiated the 2-axes 
from the others. That AP can a priori assume only discrete 
values m, 7(7 -f- 1) is essentially due to the fact that the rotation 
group is closed ; since the group of displacements in time is open, 
the analogous result for the energy need not in general hold. 
In this connection we wish to emphasize again that the operator 
H depends on the dynamical relationships existing in the system, 
whereas the representation 9? induced by the group of rotations 
is determined only by the kinematical situation (number of 
elementary particles, etc.). The signature I also assumes a 
definite one of its values d: 1 in each sub-space 9?;. For lack 
of a better name we call the states which lie in the sub- 
space 9f;, which is invariant under the group of rotations, 
simple** states of inner quantum number 7. We must 
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be prepared to find that j may here assume half-integral as well 
as integral values, in contrast with the Schrodinger theory. 

On uniting two kinematically independent systems, with 
system-spaces 31, 31' in which the rotation group induces the 
representations 3?, 3?', the total system has as system-space 
3t X 9t', in which the representation 3? X 3?' is induced. In 
particular, the moment of momentum of the total system is 

(3Ji X 1) + (1 X W) 

where 3)i and 3JI' are the angular momenta of the two partial 
systems. The theorem that the moment of momentum behaves 
additively with respect to composition is contingent only on the 
assumption that the parts are kinematically independent, 
whereas the corresponding theorem for energy applies only if 
they are dynamically independent, i.c. in the absence of inter- 
action between the parts. This difference is based on the fact 
that whereas the energy represents that actual change of state 
in the course of time, the moment of momentum represents 
the virtual change associated with a fictitious rotation. We 
reduce 31, 3i' into the invariant irreducible sub-spaces 31/, 31}^ 
respectively, i.e into the simple states of the two partial systems 
having inner quantum numbers, j, j\ The Clebsch-Gordan 
equation (III, § 5) 

5)/ X ®/' = + • • * + (L3) 

then tells us : If the two parts are in the simple states with inner 
quantum numbers f then the whole has each of the simple states 
with inner quantum number 

j-j+r, 1, • • •, \j-r\ (1.4) 

associated with it, each exactly once. To include the signature 
we must add : If the parts have as signatures the values h, 8' 
(8 = ± 1). signature of the whole has the value 8 8'. 

Compare the results which we have obtained with the 
corresponding results in classical mechanics. In both the moment 
of momentum is constant in time and the moment of momentum 
of the whole is equal to the sum of the moments of momentum 
of the two parts. Denoting the magnitude of the moment of 
momentum in classical theory by j, we have, in agreement with 

(1-4), 

\j-r\^j ^j+r, 

for the resultant of two vectors of magnitudes j, j' is a vector 
whose magnitude J lies within these limits. Quantum mechanics 
deviates from classical mechanics in the following three respects : 
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1. In quantum mechanics the square of the moment of momentum 
is j(j + 1), in classical mechanics it is j ‘^ ; 

2. Here j can assume only the discrete values 0 , 1, 

there it may have any yioji'Uegative value ; 

3. Here the ] obtained on compounding two partial systems 
can assume only those values beliveen \j -- j -f ivhich differ 
from them by an integer^ there it can assume any value between these 
limits. 

Already before the rise of the new quantum mechanics a 
semi*cmpirical description of the regularities observed in spectra 
had been given with the aid of a vector model consisting of the 
vectorial moments of momentum of the individual electrons 
and (Tf the atom as a whole ; the observations, assisted by the 
older quantum mechanics, had already led to these three modi- 
fications of classical theory.^ 

The reader will perhaps have wondered why we consider 
only the virtual rotations of space and not the translations, 
which must also be taken into account in order to arrive at a 
complete description of the homogeneity of space. The reason 
for this is tliat in studying atoms or ions we treat only the 
electrons as particles, taking the nucleus as a fixed centre of 
force situated in the origin. That this is at least approximately 
correct is due to the fact that the mass of the nucleus is many 
times the mass of the electrons. Space is thereby transformed 
from a homogeneous into a centred space ; such a procedure 
naturally allows us to consider only atoms or ions, which have 
a single nucleus. Diatomic molecules are accordingly described 
with the aid of the 1 -parameter group of rotations about the 
axis joining the two nuclei, and not by the full 3-parameter 
group of rotations of space — to this we must add reflection in 
the plane which bisects the axis perpendicularly in case the two 
nuclei are physically equivalent.^ If we are dealing with three 
or more fixed nuclei the symmetry cither disappears entirely or 
is reduced to at most a finite group of rotations.^ 

§ 2. Simple States and Term Analysis. Examples 

To each characteristic value E' of the energy H there belongs 
a definite sub-space 91' of 91, the sub-space of quantum states 
with energy level ; it consists of all states j which are trans- 
formed into E' * J by the operator H and is accordingly the 
characteristic space 9I(£'') associated with the characteristic 
value £' of H. Since the energy is a scalar, the considerations 
applied in the preceding paragraph to the total space 9i can also 
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be applied to 91' : 91' is invariant under the operators induced 
in system-space by the rotation group and is consequently the 
carrier of a certain representation of this group, which can be 
reduced into its irreducible constituents. If the energy levels 
are of at most finite multiplicity we are faced with the problem 
of reducing only representations of finite degree. Accordingly 
91 is decomposed into the “ simple spaces ” % associated with 
the rotation group in such a way that not only the square of 
the angular momentum and the signature have definite values 
in 9t^, but also the energy has a sharply defined value E^. This 
energy level Ej is necessarily {2j + l)-fold degenerate ; we 
speak of an accidental degeneracy when the energy levels of 
different simple sub-spaces 9t^ are equal. /, Mz, and H 
are all simultaneously in diagonal form ; that this is possible 
is due to the fact that these four operators all commute among 
themselves. In this way the reduction into simple states can be 
employed in term analysis : each energy level Ej possesses an 
inner quantum number j which gives the term the natural 
multiplicity 2j -T L 

On subjecting the atom to a perturbing field which destroys 
its natural spherical symmetry this (2; + l)-fold term is broken 
up into 2j -f 1 terms. Let the perturbation, i.e. its Hamiltonian 
function IT, possess axial symmetry about the 2 -axis ; if E^ 
possesses no accidental degeneracy, then in accordance with the 
theory of perturbations the perturbed energy levels are given to 
a first approximation by the portion of the Hermitian operator 
W in which 9?^ intersects itself : 

x{m) -> ^(^0 (w' “ i, j ~ 1. • • •, -- j). 

The rotation about the 2 -axis with meridian angle <f) transforms 
x{m) into e{—' m(f>) • x{m), and in virtue of the symmetry assumed 
for W this correspondence of % on itself must also be represented 
by 

e(— ni<f>) • x{m) = m') • e{— m'(f>) x{m'), 

or 

W{in, m') e\{m — m')<f>] = W{in, m'). 

But this means that all elements W{m, m') except those in the 
main diagonal vanish, whence 

Ej + W{ni, m) (2.1) 

are the 2; -|- 1 perturbed terms. The quantum number m, 
which is capable of assuming the values j, i — 1, • • •, — j, 
thus serves to label these components. Perhaps the most 
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important axially symmetric perturbation is that due to a 
homogeneous magnetic field in the direction of the z-axis 
{Zeeman effect) ; because of this m is called the magnetic 
quantum number. The inner quantum number j of a term 
can be determined spectroscopically by counting the number of 
terms appearing in the Zeeman effect. Sommer jeld first con- 
cluded, from the spectroscopic data, that j as well as m must be 
allowed to assume half-integral values. If we consider the 
Zeeman effect to be described by the analogue of the classical 
formula II, (12.5) then 

and W is rigorously in diagonal form : 

W(m^ m) horn, (2.3) 

Our analysis shows that the breaking up of energy levels due 
to an axially symmetric perturbation parallels the reduction of 
an irreducible representation of the rotation group ba when this 
is restricted to the group bg of rotations about the s-axis : by 
this is reduced into the 2j 1 one-dimensional representations 
which we have previously denoted by : 

x{7n) e{— m(f}) • x{ni). 

If two kinematically independent parts, which are in the 
simple states 5R'., are compounded together, the state of 
the composite system is in the {2j + + l)-dimensionaI 

product space X iTIy.. If the parts have the energies 

£■;, Zi'., then the whole has the energy Ej + assuming no 
interaction between the parts. Introducing a weak interaction 
between the two partial systems and assuming that there is no 
accidental degeneracy, i.e. assuming that all the remaining 
energy levels of the unperturbed system are different from Ejjf, 
it suffices, to a first approximation, to consider the section 
< Hy of the energy operator H in which intersects itself ; 
it is an Hermitian correspondence of on itself. We can 
apply the considerations, which were applied above to the total 
system-space JR X JR', to each of these JR,,': 91,;' is to be de- 
composed into sub-spaces belonging to numerically distinct 
characteristic values of <//). The rotation group induces a 
certain representation in each of these sub-spaces, and this 
can be further decomposed into its irreducible constituents. 
The result is that JRy X JR'^ is, in accordance with the Clebsch- 
Gordan series, reduced into the simple spaces JRj, / = i + 
j -p y' — |y — y'l, in such a way that in each of them 
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the energy <//> has a definite value Ej. Different Ej can only 
“ accidentally ” have the same numerical value. Consequently 
the term Ejy is broken up by the perturbation into terms Ej 
in exactly the same way as the representation *3); X is 
reduced into the irreducible representations 3)7. But this is 
only correct to the approximation characteristic of perturbation 
theory. As we have seen above, an inner quantum number 
/ can be rigorously ascribed to a term E ; in the approximation 
with which we have been dealing here there is associated with 
it in addition the inner quantum numbers j, j' of the parts, in 
the last analysis of the electrons themselves : the energy level 
E arises from a definite term Ejj> of the unperturbed system by 
interaction of the two parts. Such an association is rigorously 
possible for “ simple states,” but the rules based on it lead only 
indirectly and approximately to an analysis of the terms. 

Examples 

If we take the Schrodinger scalar wave theory to be valid 
for a single electron, then a simple quantum state of the electron 
in the field of the nucleus is characterized by the principal 
quantum number n and the azimuthal quantum number / (we 
here use the word “ azimuthal ” instead of ” inner ”). Such 
a term is (2/ + l)‘fold degenerate, and we assume there is no 
further accidental degeneration. The moment of momentum 
is represented by the operator S taken over from classical 
theory ; the square of its absolute magnitude is /(/ + 1) and 
the signature has the value (— I)^ If /electrons come togctlier 
to form an atom we obtain a term, neglecting interaction between 
the electrons, 

E[nil^ + E[n2l<^ 

of multiplicity (2/i + 1) * * * (2// + !)• The quantum numbers 
n and I refer to the individual electrons. The interaction causes 
a separation which parallels the complete reduction, obtained 
with the aid of the Clebsch-Gordan series, of 

• • • X (2.5) 

into its irreducible constituents 3)l with total azimuthal quantum 
number L Each such term is associated with the quantum 
numbers 

(fZj /j, * * b ^)* (2*fi) 

If / ^ 3 certain 3 )l appear more than once in (2.5), and we may 
therefore have several (2L -f l)-fold terms associated with the 
same set (2i6) ; these must then be distinguished from each 
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other by some further index. The square of the total moment 
of momentum is L(L + 1) and the signature (— l)h + h+ • • • +*/. 
In spectroscopy it is usual to characterize the values / = 0, 1, 2, 3, 
4, • • • by the small Latin letters s, p, d, f, • • • and the values 
L -- 0, 1, 2, 3, • • • by the corresponding capitals S, P, D, F, • • •. 

We cannot expect the scalar wave theory to be correct, 
but must be prepared to describe the state of the wave field 
in terms of a quantity ifs with several, say a, components 
(•Ai, 'Aa. ■ ’ ’> *Pa)> '-C- by a covariant quantity of a definite kind 
Each component is a function of the spatial co-ordinates 
xyz ; the components will depend on the choice of the Cartesian 
co-ordinate system in such a way that on going over to a new 
co-ordinate system by the rotation 5 the components will undergo 
among themselves that transformation A{s) which corresponds 
to ^ in the representation 9(. Again, consider bs replaced by U 2 
as the fundamental group. The general component tp„{xyz) of 
the “ vector ” tp has two indices, the index a running from 1 to a 
and the index (xyz) running through all the points of space. 
Let Sic be the vector space of functions ip(xyz) and 9L the 
a-dimensional vector space ; the state space of a single electron 
is then 91^ X 9Ic. Under the influence of the rotation s which 
sends xyz into x'y'z' tlie state ip goes over into the state ip' 
defined by the equation 

ip'^ix'y'z') liF^iPflixyz), | j] -- .4(5) ; 

fi 

the representation induced in system-space is accordingly 
91 ~ 91 X @. The moment of momentum 901 of the electron 
consists of two parts ; 

9.n = (© X 1) + (1 X S), (2.7) 

the first of which refers to the a-dimensional “ spin space ” 9Ia, 
the second to the “ translation space ” 91,. (1 X L^^), or simply 

Lj., is the operator which acts on each of the 

a components in the same way ; it affects only the index (xyz), 

leaving the index a unaltered. ^5, is the unitary transformation 

corresponding to the infinitesimal rotation about the .r-axis in 
the representation 91 ; (5^ X 1), or simply S^, consequently 

affects only the index a and leaves {xyz) unchanged. Only 
the part 2 appears in classical mechanics ; we call it the orbital 
moment of momentum, and the remaining part © the spin 
moment of momentum, or simply the spin. Its appearance 
is unavoidable so long as the wave quantity ip is not simply a 
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scalar or a set of scalars. Each of the two parts satisfies separ- 
ately the commutation rules III, (15.7), but in general only the 
total angular momentum satisfies the law of conservation. If 
the quantity ^ is of a simple kind, i.e. if 91 is an irreducible 
representation 3)„, then a = 2^ 1 and the spin © is equal to 

the moment of momentum 9II, associated with the representation 

. 

Since the Schrodinger theory has proved itself at least 
approximately correct, one should assume that to a first ap- 
proximation each of the components satisfies the Schrodinger 
scalar wave equation. So long as we consider this approxima- 
tion, the a components have only the effect of multiplying the 
multiplicity of each energy level by a. But in reality the correct 
differential equations must contain a term, the “ spin per- 
turbation," which introduces a coupling between the various 
components The electron can thus be considered in 

abstracto as a composite system, consisting of the electron 
translation with system-space 9I( and the electron spin 
with system-space 91.^ ; the spin perturbation is the weak inter- 
action between these two. Because of this the method of 
composition can here be applied. Let 91 = Decompose the 
translation space into the (2/ + l)-dimensional sub-spaces 
9l(«/) ; the corresponding energy term E{nl) with azimuthal 
quantum number I has, on neglecting the spin perturbation, the 
multiplicity a(2/ 1) and its characteristic space is the space 

9L X ^{nl) of the same dimensionality. On taking the first 
order spin perturbation into account this term is separated 
into the terms Ej with inner quantum number; and 'multiplicity 
(2; 4" 1) in a manner paralleling the decomposition of the repre- 
sentation X %i into its irreducible constituents : 

X = S%, y = 5 + /, 5 + / - 1, . • •, |f - 5j, (2.8) 

with the aid of the Clebsch-Gordan series. Care must be taken 
to differentiate sharply between the azimuthal and inner quantum 
numbers / and j. The latter is capable of assuming the values 
given in (2.8) ; whenever I ^ s the number of different terms in 
such a “ multiplet ” is 25 1. is approximately equal to 

the constant /(/ + 1), is approximately equal to the constant 
5(5 -j- 1), and is rigorously constant and exactly equal to 
j{j 1). We can thus speak of the azimuthal quantum number 
of an actual energy term only to within the approximation 
characteristic of perturbation theory. It is well to set forth 
these considerations beforehand and to approach the spectro- 
scopic data, as we shall in § 4, with them well in mind. 
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§ 3. Selection and Intensity Rules 

We return to the consideration of our system as a whole, 
without resolving it into its individual electrons, and again 
denote the total inner quantum number by j. Let A be any 
physical quantity of the system, and let it be represented by 
the Hermitian form A ; we write that portion of this form in 
which intersects SRJ, in the form 

2!a{min')x{m}x'{m'), (3.1) 

where the indices w, m' run through the values 

m j, - 1, • • •, - y ; m' =- j', j' - i, . . - /. (3.2) 

If the quantity A is a scalar^ the operator A commutes with the 
operators U{s) induced in system-space by the rotations s. 
On decomposition into tliese irreducible sub-spaces it 

follows from the fundamental theorem III, (10.5), of the theory 
of representations that the section (3.1) of A corresponding to 
the transition is zero if / j and a multiple of the 

{2j + l)’diniensio7ial unit form 

m 

iff = j. 

An analogous situation exists for the group b 2 of rotations 
about the z axis. With respect to it tlie total system space 
decomposes into 1-dimensional invariant sub-spaces in 

which the rotation with angle cf) induces the representations 
. x(,ni) - > e{ — ni(f>) x{m). If we only assume that the physical 
quantity A possesses axial symmetry about the s-axis it follows 
that the coefficient a{min) is necessarily zero when the magnetic 
quantum numbers m and ni of the initial and final states are 
different. 

We now consider a vectorial quantity q with the three 
components qy, qz instead of the scalar quantity A. This 
is of particular importance because such a quantity, i.e. the 
electric dipole moment q of the atom, determines the interaction 
between the atom and radiation— to that approximation in 
which the linear dimensions of the atom may be neglected in 
comparison with the wave-length of the emitted light. If the 
degeneracy of the energy level Ej is destroyed by an external 
axially symmetric perturbation, e.g. a homogeneous magnetic 
field in the direction of the s-axis, then the spectral line caused 
by the transition -> from the term Ej to E], is broken 
up into the lines associated with all possible transitions 
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(9?^, m) m'). On calculating the part of the Hermitian 

form representing the electric dipole moment in which the sub- 
space ytj intersects SR'/ : 

I!c\{mm')x{m)x' {rn) , (3.3) 

the ratios of the squares of the absolute values of its 

coefficients determine the relative intensities of these {2j + 1)(2/ + I) 
lines. Since qz is axially symmetric about the s-axis qz{nim') “ 0 
unless m' ~ m \ we thus have the selection rule 

qz: m-> m (3.4) 

for the 2 -component of the electric moment. On performing 
the rotation with angle 6 about the 2 -axis x{m), q^. iq^, q^ — iqy 
are multiplied by e{—mf), e{(f>), e{~<t)) respectively. Since 
x{m)x'{m') is therefore multiplied by e[{m — m')(f>] we obtain 
the selection rules 

qx + iqy • ni-> m — 1, q^ — iq ,, : m -> m + 1 (3.4') 

for the X- and y-components of q. O^tly the transitions 

m m — 1, m, m+1 (3.5) 

of the magnetic quantum number are allowed ; the first and the 
last generate two waves which are circularly polarized in the xy- 
plane in opposite directions^ and the remaining transition m -> m 
generates a wave which is linearly polarized in the Z'direction. 
If the equation (2.3) holds for Zeeman effect, the wave number 
of the component m m' is displaced by an amount o{m — m') 
from its unperturbed value. Thus in “ yiormal Zeeman effect ” 
we obtain instead of (2; + l)(2y' + 1) components only three, 
whose polarization is as described above and whose wave numbers 
are displaced by the amounts 0, d: o. That the resolution of 
the two terms Ej, E'-, is almost entirely hidden is due to the 
fact that the factor of proportionality ho in (2.3) has the same 
value for both terms. Fortunately most of the cases actually 
observed show “ anomalous Zeeman effect^'" in which the resolu- 
tion of the terms can be seen clearly ; in order to explain it 
we must change the expression (2.2) for the perturbation due 
to the magnetic field. But the above selection rule for the 
magnetic quantum number, which has been obtained from 
fundamental principles of group theory, is valid in all cases. 

The selection rule for the inner quantum number j is obtained 
in an analogous manner. The three components q^^ qy^ qz of q 
suffer the transformation s among themselves when the x{m), 
x\m^) are subjected to the transformations corresponding to 
s in the representations 35^, 2);' respectively. Or, if we wish to 
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express it in terms of Uj instead of bj, ^ is that transformation 
which is associated with the element a of U 2 in the representation 
'3)i. This is, of course, merely an expression of the fact that q 
is a vector. Now, in accordance with the terminology intro- 
duced in III, § 14, (3.3) is a vectorial quantity in the representa- 
tion space of X and we are interested in determining 
how many linearly independent quantities of this kind there 
are. Their number is given by the number of times is 
contained in X or Xi, X as an irreducible constitn''nt. 
But in accordance with (1.3) occurs in x exactly 
once if 

j' = ) — 1 or j or ) -L 1 

and otherwise not at all, and we must further exclude the case 
j j' -■ b. W e thus obtain the selection rule 

i, ; + l (3.6) 

'd’ith the proviso that 0^0 does not occur. Since there exists 
but one linearly independent vectorial quantity in the repre- 
sentation space of %, X '2D,' in the cases in which the selection 
rule is satisfied, the components of q(w. m') are determined by 
purely group-theoretic considerations to within a constant factor 
of proportionality. 

In order to calculate the vectorial quantity (3.3) for/ = ; — 1 
wc proceed as follows. Let rj' be tw'o arbitrary points 

on the unit sphere which transform cogrediently under u. 

*^bcn the fundamental invariant, and the three 
b)rms which are obtained from 

I, (ic + mr (3.1) 

by multiplication with 

— 


transform in the same way as the {x + iy)-, {x — ty)-, s-com- 
ponents of a vector, respectively. They are linear in the 
monomials of degree k 2 2j and in the monomials 

of degree k ^ 2j\ Introducing 


x{m) = f}-o 
\' r ! 5 ! 


(2] r + s = k 2, 2m = r — s\ 


V r' I 5 ' ! 

2 / =: / + . 9 ' = k. 

2ni' = r' - s’) 
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as co-ordinates in the representation spaces of 5)^, we find 
that the three forms above are of the type (3.3) with / = j — 
For example, we obtain for the {x -}- jy)-component 



(firniM*. 

(r - 2) ! s ! 


(r-2)+»=r- 



Vr ! 5 {r — 2) ! ^ ! 


Vr(r - 1) 


= - r V(j‘ + m){j + m- l)x{m)x'{m - 1). 

m 


In agreement with the selection rule w -> w — 1 there occur here 
only those terms for which m' — m — 1. Calculating the 
(x — iy)- and 2 -components in the same way, we find for the 
transition 


{qx + iqy){i^, m- l) = - V { j + m){j + m- I), 

{q^ — iq,j){m, m + 1) = V {j — m){j ~ m — 1), (3.9) 

qzim, m) = V(j + m){j — m). 

In order to calculate the components for the transition j — / 

we must replace the factors (3.8) by 

2VI, 2 ^% 

which also transform like the {x -+- iy)-, [x — iy)- and 2 -com- 
ponents of a vector. Finally, for the transition / = j I wc 
must replace (3.8) by tj'*, — Since the angular mo- 

mentum 9JI is a vector, the formulae for the transition j -> j 
must naturally agree with those already obtained for 2K [III 
(15.9)], and since q is Hcrmitian the formulae for the transition 
j -y j I must agree with those obtained by taking the 
Hermitian conjugate of the components for the transition 
- 1 . 

j / = j. 


{qx + iqv)i*»> m — 1) = v (j + m)(j — m -f 1), 

{qx - W -f 1) = V {j - m){j + m + 1 ), (3.9) 

qz{m, m) — m. 

j ^ j' 

{qx + iqy){‘*n, m — 1) = V ( i — w -f l)(j — m -f 2), 

{qx — W -f- 1) — — V {j + W + ^){j 4- W -p 2) , (3.9) 

qz{m, m) = V(; -p + l)(i — m + 1). 
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In each of these three sets of formulae the right-hand sides are 
determinate only to within a common factor of proportionality 
which is independent of m, but which can be completely deter- 
mined only by integrating the wave equation of the dynamic 
model of the atom, and not by the theory of groups alone. 
The coefficients which do not occur explicitly in the above 
formulae are all null. The squares of the absolute values of these 
coefficiefits yield the [rational \) intensity ratios of the components 
into which a line is split by the perturbation. 

Already before the rise of the new quantum mechanics the 
intensity formulae (3.9) for the components of a line emitted 
under the influence of a magnetic field were obtained from the 
observational data under the guidance of the correspondence 
principle.^ In the new quantum mechanics they are, as we 
liave seen, a consequence of the most general principles, and we 
would find ourselves in serious difficulties if they were incorrect. 
Nevertheless it is to be remembered that they can be invalid 
(1) if the spherical symmetry of the system is destroyed by 
external perturbing fields, or (2) if for short wave-lengths the 
interaction between matter and radiation is no longer determined 
primarily by the electric dipole moment. 

Since the dipole moment is a proper vector, as the components 
over into — r/j., — r/, on refiection i in the 

origin, the representation induced on them by it] has as 
signature — 1. If the signatures of iHj, are 8, S', then under 
the influence of the reflection i (3.3) is multiplied by the factor 
88'. Vhe coefficients q{nim') must accordingly all vanish unless 
88' - - — 1 ; the selection rule for the signature is 

8-^-8. 

If the individual electrons are governed by the scalar wave 
theory the total azimuthal ciuantum number L of the atom 
can jump only to L — I, L or L -f* I, while the sum of the azi- 
muthal quantum numbers of the individual electrons /i T ^2 T ’ * * + 
can change only by an odd integer [Laporte's rule). In the case 
of a single electron, /= 1, only the transitions / > / ± 1 are 
consistent with these rules ; this result has already been obtained 
in II, § 5, from the theory of spherical harmonics. 

The formulae (3.9) allow us to solve a problem which we shall 
here, for the sake of future application, introduce from the 
physical standpoint. A partial system in the simple state 91^ 
is compounded with a second in the simple state to form 
a single system. In 9?,,^ — 9tj X 9t^^, U 2 induces the representa- 
tion *3) X 3),' ; let the corresponding moment of mo- 

mentum be 9)1. On adapting the normal co-ordinate system 
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in 9^^^/ to the complete reduction of 2) into its irreducible con- 
stituents 9K is broken up into square sub-matrices Wj of 
length 2/+!, arranged along the principal diagonal, corre- 
sponding to the decomposition of into sub-spaces But 

the same is not true of the moment of momentum X 1 of 
the first partial system, and we wish to determine the portion 
of this matrix in which intersects itself. That is, in physical 
language, wc wish to determine the temporal mean value 
of the moment of momentum of the first system in the state 
defined by the quantum numbers /, j' ; J of the two parts and 
the whole. We assume that the interaction between the two 
parts resolves the energy level into distinct levels Ej on 
applying the theory of perturbations. Since is a vector we 
know, from the same considerations as we applied to the electric 
dipole moment above, that the portion of it corresponding to 
the transition J J must be a multiple of 9JL : 

<m, X r>^ == (3.10) 

In order to evaluate the proportionality factor k we construct 
the scalar product of the matrices (9J}, X 1) and ; since 

m (9J?, X 1) + (1 X 
these two matrices commute and we have 


(1 X + (9«y X 1)'^ - 29)1(9)1, ■ X 1) 

or 

29J1(9}1, X 1) = jij + 1) - nr +1)4- 3)^^ (3.11) 

for since in the original co-ordinate system (9)1, • X 1 )* was 
j{j + 1 ) times the unit matrix, it remains the same in the new 
co-ordinates. And, on the other hand, 9)1(9)1/ X 1) is equal 
to Kj' JiJ + 1) times the unit matrix in the sub-space Slj, as 
follows from (3.10). Hence from (3.11) 


‘ixjJU + 1 ) - + 1 ) - nr + 1 ) + JU 1 ), 
^ _ 1 , Hi + 1 ) - nr + 1 ) 

^ ■ 2 ^ 2JU + 1) 


(3.12) 


§ 4. The Spinning Electron, Multiple! Structure and 
Anomalous Zeeman Effect 

We have hitherto ignored the fact that the terms of the 
alkali spectra, characterized by the two quantum numbers n, /, 
are in reality not simple. Each of these terms — with the ex* 
ception of the s terms I — 0 — actually consists of a fine doublet. 
By § 2 the (n, 1) term should be resolved into 2/ + 1 components 
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in a magnetic field ; instead we find that one of the doublet 
terms breaks up into 2/ components and the other into 2/ + 2. 
We should accordingly ascribe to them the inner quantum 


numbers _/’ = / — 


2’ ^ ^ + 2 » respectively. 


Our general considerations immediately give us a hint as 
to how this discrepancy is to be explained. The quantity tp 
describing the wave field is not a scalar, but is instead a tovariant 
quantity of the kind *3)^, having tzvo components ((//j, ipf). I'his 
is the theory of doublet phenomena as developed by W. Paulin 
It seems indeed easy to arrive at this conclusion after the 
preparation of the preceding paragraphs, but historically this 
systematic foundation was developed only after Pauli's dis- 
covery. It is quite immaterial whether we associate the matrix 
+ 1 or the matrix — 1 with the element l in the representation 
3)^ of U 2 . Taking the first of these alternatives, the signature 
has the value (-— 1)^ in the quantum state {nlj) ; hence Laporte's 
rule remains rigorously correct on taking the spin into account. 
We have as further rigorous selection rules those concerning 
the total inner and the total magnetic quantum numbers. In 
the representation 3)]^ the transformation a itself corresponds 
to the element a of U 2 , and by III, (15.6), the spin moment of 

momentum is ^(3, where © is the vector already defined with 


components 


5 * = 


0 

1 

, S,, = 

0 

— i 

, 5, == 

1 

0 

1 

0 


i 

0 


0 

-1 


We shall not as yet attempt to find the specific effect of the 
spin perturbation on the wave equation. This was done origin- 
ally by picturing the electron as a small material sphere, the 
rotation of which gave rise to the spin ; the additional moment 
of momentum required by spectroscopic observations was first 
introduced in this way by Goudsmit and UhlenbeckT Since 
5 . is capable of assuming only the values ± 1 it appears as if 
the spin axis can only be quantized along the positive or negative 
2r-axis ; we need not go into the false conductions this assertion 
can lead to on interpreting it literally. The spin perturbation 
must appear in going over from classical to relativistic mechanics. 
The terms of the hydrogen atom, calculated in accordance with 
the scalar non-relativistic wave mechanics, depend only on the 
principal quantum number n, but the theory of relativity intro- 
duces a correction which causes the terms corresponding to the 
various values of / to split apart and form the so-called fine 
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structure. We should therefore expect the same scheme of 
terms in hydrogen as in the alkalies, but observation shows 
that the doublet separation of an I term into two terms with 

i = / ± I is just such that two terms with the same j, but with 
different / = _? i 2 > exactly coincide. Hence the spin per- 
turbation in hydrogen agrees quantitatively with the separation 
caused by the relativity correction. 

The alkali doublets show anomalous Zeeman effect. Other 
elements, such as alkaline earth metals, have (in addition to 
triplets) a system of singlet terms, and singlet terms always 
show normal Zeeman effect in a magnetic field. It therefore 
seems probable that the anomalies in Zeeman effect are closely 
connected with the spin. The magnetic separation of an alkali 
term is quite independent of the principal quantum number n ; 
all the terms of a series behave in the same way. A term (/, j) 
splits up into 2j + 1 equi-distant components, characterized by 
the magnetic quantum number m, but their separation is hog 
instead of /to, where g is a rational function of I and j (the “ Lande 
g-factor ”). The energy value of the component m is therefore 
displaced by an amount 

hog-m (m = J, - 1, • • (4.1) 


from its unperturbed value. The empirical jormula for the factor 
g, which is due to Lande, is 


^ 21+ V 


(4.2) 


This formula holds for weak magnetic fields, in which the separa- 
tion is of a smaller order of magnitude than the doublet separation. 
1 

If / = 0, ; = -, we have in particular g — 2. 

This latter fact gives a hint toward the solution of the puzzle : 
If the total moment of momentum consisted only of the spin 
(fi = 0), its magnetic effect would be twice as great as if it con- 
sisted of S alone. We th.ercfore assume that the magnetic effect 

of the spin ^ 6 is twice as great as that of the orbital angular mo- 
mentum S ; the perturbation due to an external magnetic field 
is therefore to be taken as 


W 


eh 

2fj,c 


8 + ©) 




(4.3) 
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The spin offers an explanation of lohy the beam in the Stern- 
Gerlach experiment is separated into two parts. The valence 
electron of the univalent silver atom is, in the normal state, 

in an 5-orbit (/ — 0) ; hence j = - and m can assume only the 
1 ^ 

values i 2- Although the component of the mechanical 

moment of momentum in the direction of the magnetic field 

can have only the values ± the experiment shows that the 

value of the magnetic moment of the atom is a whole Bohr 
magneton, and not the half of one ; but we now see that since 
the mechanical moment of momentum consists only of spin 
it should give rise to twice the expected magnetic moment. 
The connection between magnetic nniment and mechanical 
moment of momentum is even more apparent in the magneto- 
mechanical effect : the demagnetization of a vertically suspended 
bar of weak iron must result in giving to it an angular momentum. 
The ratio between the change in the magnetic moment and the 

6 

moment of momentum was expected to be but the experi- 
ment, which was performed only on ferro-magnetic bodies, 
yielded twice this value. The anomalous magnetic behaviour 
of the spin also accounts for this result, if we assume that the 
mechanical moment of momentum in ferro-magnetic substances 
is due entirely to the electron spin.® 

Does this hypothesis also explain the general Lande formula 
(4.2) } This is answered by the formula (3.12) obtained toward 

the end of § 3, in which 7, / must be taken as j order 

that it apply to the composition of electron spin and electron 
translation. We find that in the state {Ij) the temporal mean 

value of the spin is equal to multiplied by the factor 


R - 


1 , L- (ii+j) 

2 2/(; + 1) 


or 


g - 1 = ± 0/ ^ ± ^ 


21 4- i 


(4.4) 


Hence by (4.3) 


eh 
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So long as the magnetic separation is small compared with the 
spin perturbation the Zeeman separation of the term {Ij) is 
determined primarily by <IF> ; (4.4) then leads, in fact, to 
equation (4.2), in agreement with the empirical data. 

If the atom consists of several, say /, electrons, the situation 
then arising can be understood with the aid of the general rule 
of composition. If the electrons are in quantum states with 
inner quantum numbers jV and energy levels E{jr), (^ ” I, 
2, • • •,/), then on neglecting the interaction between the electrons 
the total system has a -f- I) * * * (2j/ + l)-fo]d energy level 
-[-••• -f- E{jf). If this level coincides with none of the 
other levels it is resolved by a small perturbation into terms 
with total inner quantum numbers / in a manner corresponding 
to that in which the product 

X X • • • X (4.5) 

is reduced into its irreducible constituents %j {Clebsch-Gordan 
series). Obviously in order that this (jj) coupling lead to an 
adequate description the mutual interactions between the 
electrons must be small compared with the spin perturbation. 

The situation usually met is, however, the opposite of that 
contemplated above : the normal term order corresponds to 
the Russell-Saunders or (si) coupling. Neglecting for the moment 
the interaction between the electrons as well as the spin per- 
turbation, we are led to a 2^(2/j + 1) * * ’ (2// + l)-fold energy 
level (2.4) in whose characteristic space the rotation group in- 
duces the representation 

X X 3),^ X • • • X (4.6) 

Due to the interaction between the electron translations the 
second factor is reduced in a manner analogous to (4.5) ; a 
single term with azimuthal quantum number L has now the 
multiplicity 2^(2L + 1). We next reduce 

= 1%, (4.7) 

and finally, as the last step, we carry out the reduction 

X = 2^^j, {J — L s, L s — I, • • •, ]L — 5|), (4.8) 

associated with the coupling between the spin and the orbital 
moment of momentum. The terms which result from this 
last reduction form together a multiplet. Each multiplct is 
therefore associated with a definite azimuthal quantum number 
L and a spin quantum number s ; the individual members of 
the multiplet are distinguished by the inner quantum number J. 
We call 25+1 the midtiplicity, although the number of terms 
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in the multiplet is only actually equal to this when L ^ s, as 
by (4.8) their number is less if L < s. The 2Ldimensional 
representation is even or odd according as / is even or odd. 
The reduction (4.7) into irreducible constituents accordingly 
yields only integral values for ^ when / is even and only half- 
integral values when / is odd : The term multiplicities alternate 
regularly between even and odd as we run through the atomic table 
in the order of increasing atomic number (H even, He odd, Li 
even, Be odd, etc : “ alternation law ”). h'or / — 2 we have, for 
example, 

It is empirically found that the bivalent alkaline earth metals 
have in fact a singlet and a triplet system of terms. But in the 
triplet system the 5 terms, for which L = 0, are simple ; only 
the P, Z), • • *, terms have the actual multiplicity 3. 

Instead of considering all the electrons at once as in (4.6) 
we can build up the atom by successively adding one electron 
after another. On adding a next electron, say the to an 
atom or an ion A' ^ a multiplet of A' characterized by azi- 
muthal quantum number L and spin 5 breaks up into all those 
multiplets contained in the representation (2)^ X 2)^) X X 
where / is the azimuthal quantum number of the electron 
added. Since 

*2)., X = 2)5 a 4" 

L* = L + l, L + / - 1, . . \L - /], 

this results in multiplets {s^ , L*), one for each of the pairs 

5 * L*^^^LT~l. L + /--1, •••. 1L-/| (4.9) 


(“ branching ride ”). 1 he alternation law is again contained in 

the first of the above equations. It is to be noted, however, 
that the Pauli exclusion principle for equivalent orbits, which 
will be discussed in part C of this chapter, materially restricts 
the array of multiplets allowed by this rule.^ 

Again applying (3.12) to the composition of spin and orbital 
moment of momentum, we find that the 27 + 1 components 
into lohich a J term of a multiplet (. 9 , L) is split in a weak magnetic 
field are displaced from the unperturbed positions by the amounts 

hog • m (w = 7, / — 1, • • •, — J) (4.10) 

where the separation factor g is given by 




JU + 1) - L(L + 1) + s{s + 1) 
27(7 + 1) 


(4.11) 
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This is exactly the formula which was derived empirically by 
Lands ; we here see the importance of the fact that the square 
of the absolute value of the moment of momentum (or 2 or (3) 
is calculated from the quantum number J (or L or s) by J{J -f 1), 
etc., instead of /^, etc., as in the older quantum mechanics. 

When the magnetic field increases to such an extent that the 
magnetic separation becomes comparable w^'th the separation 
between the terms of the rnultiplet we must handle both the 
perturbation to which the rnultiplet separation is due and the 
magnetic perturbation together. In order to express the small-, 
ness of the term in the Hamiltonian function to which this 
former perturbation is due, we introduce a factor p which will 
appear in the same way as the factor o in the magnetic term ; 
the case of a weak magnetic field may then be expressed by 
saying that o is small in comparison with />. We can consider 
o and p as variables which increase gradually from 0 to their 
actual values and follow the dependence of the separation on 
their ratio. We therefore write the perturbation term in the 
Hamiltonian function in the form 

W ^ pW' + olT". 

Since the decomposition (4.8) need not for present purposes 
be expressed in terms of its ultimate constituents, the individual 
electrons, we may here denote the azimuthal and inner quantum 
numbers by / and j. Let the representation spaces of 2)^, 
be r^, with co-ordinates ^(m.s), x{mi) respectively. Denote 
the moments of momentum SOR.,, of these two representations 
by §, S respectively ; if the magnetic field has as its direction 
the 0 -axis, then 

= h{L, + 2s,), (4.12) 

The co-ordinate system is again to be so chosen that the rotations 
about the 0 -axis appear in reduced form ; to such a rotation 
of angle <f> corresponds the transformation 

i{ms) -> e{— m,(f>) • |(m.,), x{mi) -> e{— mi(f>) • %(w,) ; 
the range of the quantum numbers and is given by 

= 5, 5 — 1, • • •, — 5 ; mi = /, / — 1, • • •, (4.13) 

The variables of t., X then behave like the (2^ + 1)(2/ + 1) 
products 

^{nis) • x{ini) (4.14) 

and are multiplied, under the influence of a rotation (f> about 
the 0 -axis, by e{— m(f>), where 

m = nis nil. 
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We now reduce 3), X 5)j into its irreducible constituents 3),. 
Let the co-ordinates of the (2j -f l)-diniensional irreducible sub- 
space of t,, X in which the representation 3), takes place, 
be denoted by 

x{] ; m) (m = j, ; - 1, • • _ j), 

nt is the magnetic quantum number, i.e. under the influence of 
the rotation ^ about the 2 ;-axis x{j ; m) is multiplied by e{— m<fi). 
The co-ordinate transformation which leads to the complete 
reduction of 5)^ X S)i into its constituents is obviously of 
such a kind that xij , m) is a linear combination of those of the 
variables (4.14) for which w., + has the value m. 

If the unperturbed system possesses no accidental degenera- 
tion the separation is determined by that part of the matrix 
(4.12) in which the sub-space X 9ii of W intersects itself. 
We must therefore solve a secular equation G of degree 
(2^ + 1^)(2/ + 1) ; but the problem is materially simplified by 
the fact that the perturbation term possesses rotational symmetry 
about the s-axis, as the only non-vanishing elements of the 
matrix W are those for which tn m. The one secular equation 
G is consequently broken up into 2{l + -J*) + 1 secular equations 
Gm corresponding to the possible values 

m == I + I + s — I, • • •, — (/ + 5 ) 

of m. The degree of G,„ is given by the number of possible 
partitions of m into two summands which run through 

the ranges (4.13). In the case of a single electron, / “I, we 
have only equations of the first and second degrees, and the 
calculation can therefore be carried through completely for this 
case.^® 

The roots of the secular equation G^ are the displacements 
of the energy terms due to the perturbation. Since the trace 
of a matrix is an invariant, the sum of the term displacements 
which are associated with a definite value m of the magnetic 
quantum number (the roots of the secular equation G^) is equal 
to the sum of the terms in the principal diagonal of this portion 
of W, i.e. to 

W{ntstni^ 

(m, + mi = m) 

It is therefore a homogeneous linear function of p and o (“ stint 
rule ”). We obtain the part due to the magnetic field by putting 
p = 0 ; by (4.12) this is 

oW”{m,Tni, nit nil) = ho{mi -f 2m,). 


14 
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On the other hand, the formulae (4.10), (4.11) determine the 
term displacements in the case in which o is small in comparison 
with p. In consequence of the sum rule these two results must 
agree. I and s being fixed once and for all, we denote the Land^ 
g-factor (4.11) by g{j), and we then have 

Zinti + 2m,) m • 2Jg{j)- 

The sum on the left is extended over all partitions of m—mi-)-m, 
for given m, and that on the right over all values of j which 
are consistent with the conditions 

j = |m|, \m\ + 1, ■ ’ - ] j = I + s, I + s — 1, ■ • \l — s\. 

g{j) can in fact be determined from this equation. For m=/+5 

both sums reduce to a single term ; we then have 

I + 2s {I + s) • g{l + s). 

For m ~ + 5 — 1 there arc two possibilities for (m^,, m^) and 

two for j : mi=^ I, nis — s — I or = / — 1, = 5 ; 7 = / + 5 

or / + 5 — 1. Consequently we must have 

2/ + 4^ — 3 = (/ 4" +• s) + -j- s — 1)}. 

In this way we obtain recursion formulae for the successive 
calculation of g{l + — 1), ' ’ *• The reader can 

readily verify that the result of the first few steps agrees with 
(4,11). 

It is to be noted that in following the terms from a weak 
to a strong magnetic field they cannot cross each other, con- 
sidered as functions of the monotonic increasing parameter 
o:p; the “singular elements “ of a unitary group, i.e. those 
elements for which two or more characteristic values coincide, 
constitute a manifold of three, and not simply one^ fewer 
dimensions. 


B. The Lorentz Group 

§ 5. Relativistically Invariant Equations of Motion of 

an Electron 

We have as yet obtained no specific expression for the spin 
perturbation ; that for the magnetic effect due to an external 
field was set up with the aid of the experimental facts. It is 
clear that we can arrive at a satisfactory theory of the electron 
only when we are able to express its fundamental laws of motion 
in a form which is invariant under Lorentz transformations, as 
required by the restricted theory of relativity. The solution of 
this problem is due to Dirac.^^ We saw in III, § 8, how the 
2-dimensional representation of the rotation group, which, 
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following Pauli, characterizes the covariant quantity i/t ^ ^ 2 ) 

describing the wave field, can be extended to the group of 
positive Lorentz transformations, i/fj, play the same role as 
the variables tj introduced in connection with 

Following de Broglie we took as the wave equation of a 
particle of mass m in field-free space 


/ii 4- _L 


1 ^ 


y 



cm\ 

T/- 


(5.1) 


But this equation is not in agreement with the general scheme 
of quantum mechanics, which requires that only first order 
derivatives with respect to the time appear. The formulation 
of a relativistically invariant differential equation satisfying 
this requirement is, as Dirac discovered, made possible by the 
transition from the scalar wave function ip to one with two 
components. We seek to derive these dynamical equations 
from a Hamiltonian principle. 

Let 

Xo = ct, Xi ^ X, ATg ^ y, x^ = z 


constitute a normal co-ordinate system in our 4-dimensional 
space-time. If the quantity oj is of the same kind as 0, the 

quantities behave, in accordance with III, (8.16), like the 

four components of a 4-vector ; the 5,^ are the matrices defined 
in III, (8.15). Hence in particular 




are the components ds^ of an infinitesimal vector ; we are here 
dealing with a linear correspondence which is independent of 
the co-ordinate system employed and which sends the vector 
dx over into ds. Its trace 




t)Ar. 


( 6 . 2 ) 


is consequently a scalar and its integral (multiplied by Iji) 

M = I • dx {dx = dxo dxi dx^ dxa), (6.3) 

t J a 

extended over any finite portion of the world, is a quantity which 
is independent of the co-ordinate system.* 

• The letter M used for the material part of the action is not to be confused 
with the moment of momentum. 
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Although M may not be real, it is practically real in the sense 
that M — M is the integral of a complete divergence. For 
since the S„ are Hermitian matrices, 


Af - - 

i J a 


and M — M is in fact the integral of 

1 

i . ' 


In using M as an action we are not interested in M itself, but 
only in its variations 8 M caused by arbitrary infinitesimal 
variations of ^ (^,, ^2) which vanish outside of a given 

finite portion of the world (the integral is then extended over 
the entire world or, what amounts to the same, over this finite 
portion). The circumstances mentioned above guarantee that 
hM is real ; on writing it in the form 


8 M = • u> oj ‘ Stfi)dx 

we find on comparison with ( 6 . 3 ) that 


O) 



itft 


We thus arrive at the first order differential operator 


V . (5.4) 

From the invariance of ( 5 . 2 ) it follows that this operator trans- 
forms t/t = (i/ti, ^2) irito a quantity = (^j, ijQ which trans- 
forms contragrediently to ifi — (i^i, under the influence of 
an arbitrary positive Lorentz transformation. If we wish to 
guarantee that M is real, we may replace the original definition 
by 



In III, § 8 , we found it necessary to introduce quantities 
^ii 02 which transform contragrediently to 0 ,, 02 order to 
be able to extend the restricted Lorentz group to the complete 
group. And just as V applied to 0 generates a quantity of the 
kind 0 ', in the same way the “ conjugate ” operator 

, 8 
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transforms ip' into a quantity of the kind tp. V'V is, as is readily 
verified, the operator 


1 ?^ 

ixl 



Consequently equation (5.1) for fpi, ip^ can be written in the form 


+ m^tp' = 0 , 

t 

iv'i/j'' -\r niQip = 0 


( 6 . 6 ) 


on introducing an auxiliary pair of components ip'. From now 
on we denote the column of the four components i/tj, ip ^ ; i/q, tp^ 
by tp and employ 5, as the .symbol for the transformations of 
these four components as in the latter part of Chapter III ; 
with this understanding the differential equations (5.6) arise 
from an action integral which is composed additivcly of the 
quantity M, (5.3), and the invariant [cf. Ill, (8.19)] 

M' = ttto^tpTtp • dx. 


M and AI' are also invariant with respect to interchange of 
right and left, and under the spatial reflection i in the origin. 

In accordance with the general scheme of quantum mechanics 
the differential equations for ip should, as already remarked, 
contain only the first derivative of ip with respect to time ; the 
additional requirement that it be relativistically invariant then 
leads to the conclusion that it can also contain only first de- 
rivatives with respect to the spatial co-ordinates. We have 
here been able to satisfy these requirements without altering 
the actual content of de Broglie’s equation (for the components 

^ 2 ) ; equations thus obtained are to be taken as the 
equations for a free particle. This formal transition to first 
order equations will become physically significant only when 
we pass to the derivation of the equations of motion in an electro- 
magnetic field with the aid of the principle of gauge invariance 
developed in II, § 12. According to it, if — <po is the scalar and 
<pi, <pz, <p 3 the vector potential, we must replace 
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It will be found convenient in the following to introduce the 
quantities f„ obtained by multiplying the potentials (ft^ by the 
€ 

factor T-. Then in 
he 



tjf • Vip • dx 

(B.8) 

the operator V is defined by 



8 

11 

> 


(5.9) 


Because of this gauge invariance the quantities M, M' are 
unchanged on replacing simultaneously 


tp by e'Y and /, by f„ — — , (6.10) 

dx^ 

where A is an arbitrary function of position in space-time. Now 
take A to be an infinitesimal function which vanishes outside 
a certain finite portion of the world : then 8M and 8M' must 
automatically vanish for the variations 

^ = (511) 

The complete expression 

8(A/ + M') = ( [(8(A • (o + w • Stf,) + 8f„]dx 

a 

for the variation automatically tells us that under the assumption 
that the laws of matter (5.6) are satisfied, i.e. that w = 0, 

8(M + M') = fi:5»8/, • dx. 

» 

Hence we have as a consequence of the laws of matter 



i.e. the continuity equation 

= 0. (B 12) 

A glance at the explicit expression for M shows that 

= (5.13) 

these are the quantities which formed the starting-point for 
the theory of the transformations of ip as developed in HI, § 8, 
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and we already know that they form the components of a 
4-vector which is independent of the particular space-time 
co-ordinates employed. The time component 

= (01*Al + 0202) + (0101 + 0202) (5*14) 

is the probability density and hence 5 ^, s^) is what 

may be called the probability current : in order to obtain the 
number of particles which will on the average pass through 
a surface element do in time unit, multiply the total number of 
particles present into the product of the area do and the normal 
component of the vector On integrating the equation (5.12) 
over a volume V we find that the increase in the mean number 
of particles in V per unit time is equal to the mean number of 
particles entering V through the surface in unit time. In 
contrast to the provisional scalar theory^ the Dirac theory leads in 
a most natural ivay to expressions for the probability density, as 
loell as the probability current, lohich depend on ifj alone. 

On integrating 

dxi dx2 dx^ 


over the whole of space we find that the integral is independent 
of time — and, in accordance with the statistical interpretation 
of 0, is to be so normalized that its value is 1. Consequently, 
in the dynamical law 


1 difj 
i dt 


+ //0 = O 


the energy Hjh is a Hermitian operator, as should be. We 
shall from noio on take h as the unit of action, with corresponding 
units for linear and angular momentum. The result of this is 
that the quantity h disappears completely from the laws of 

1 ^ 

quantum mechanics. With the usual abbreviation, p^ = 

-// ™ /o + 27 Sripr fr) '^UqT, (5.15) 

c r = l 

The influence of the electro-magnetic field on the matter is 
taken care of by (5.9), but, on the other hand, the matter gener- 
ates the electro-magnetic field in accordance with Maxwell’s 
equations. In order to express this explicitly we must add to 
M -f M' the Maxwellian action 

F — Q j{(/23 + fii + /12) (/fo + /20 + /so)}^^ ( 6 . 16 ) 
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of the electro-magnetic field, where the 

f - 

iXp 

are the field strengths — which are unaffected by the change of 
gauge (5.10). F is obtained from 

(5.17) 


by multiplication with c 



2 


(5.17) is the action in 


Heaviside units, which are best adapted to the electro-magnetic 
field theory. Since we have taken h as the unit of action, the 
total action of our system, consisting of matter plus field, is 


W = M + M' + 



(5.18) 


For reasons which will be apparent later the real number a/47r 
is called the fine structure constant. Whereas the variation 
of the tp in the Hamiltonian integral SW • dx yields the equations 
of matter^ variation of /, leads to the equations of the electro^ 
magnetic field with 

— e • 3°" = — e • ip (5.19) 


appearing as the 4-vector of charge and current density. The 
only constants occurring in the field equations are the two 
combinations 


mo 


cm __ 
h' ^ ch 


(5.20) 


of fundamental atomic constants ; the first is a reciprocal 
length and the second a pure number. 

Schrodinger. in his fundamental papers on wave mechanics, 
thought he could explain the quantum behaviour of matter 
and radiation “ classically ” by setting up a closed system of 
field equations such as we have obtained above. In particular, 
he held that the charge of the electron was actually “ smeared ” 
over the whole of space with the density — e • s^. But there can 
be no doubt at the present time that the field equations are not 
to be interpreted in this classical manner ; they must rather 
be interpreted in accordance with the statistical view-point 
developed in Chapter II. The expression (6.14) for the density 
then guarantees the atomistic structure of electricity. To show 



• RELATIVISTIC EQUATIONS OF FXECTRON 217 

this we first remark that the charge in a volume V is represented 
by ^ times the Hermitian form 

1 1 ^^tfjdx^dx^dx^, 

(V) 

But this is an ‘‘ idempotent ” form with respect to the “ vector ” 
iff ; its characteristic values are 1, 0 and the corresponding 
characteristic functions are those quantities iff which vanish 
outside or inside F, respectively. The charge contained in V 
is accordingly capable of assuming only the values — e and 0, 
i.e. according to whether the electron is found in V or not. In 
order to guarantee the atomicity of electricity the electric 
charge density must equal — e times the probability density. 
But if we base our theory on the de Broglie wave equation, 
modified by introducing the electro magnetic potentials in 
accordance with the rule (5.7), we find as the expression for the 

charge density one involving the temporal derivative ^ in 

ot 

addition to ip ; this expression has nothing to do with the prob- 
ability density and is not even an idempotent form. According 
to Dirac this is the most conclusive argument for the stand 
that the differential equations for the motion of an electron in 
an electro-magnetic field must contain only first order derivatives 
with respect to the time.^^ Since it is not possible to obtain 
such an equation with a scalar wave function which satisfies at 
the same time the requirement of relativistic invariance, the 
spin appears as a phenomenon necessitated by the theory of 
relativity. 

The theorem of the conservation of electricity (5.12) follows, 
as we have seen, from the equations of matter, but it is at the 
same time a consequence of the electro-magnetic equations. 
The fact that (5.12) is a consequence of both sets of field laws 
means that these sets are not independent, i.e. that there exists 
an identity between them. The true ground for this identity 
is to be found in the gauge invariance, for it is equivalent to 
the assertion that hW vanishes identically when ip and /« are 
subjected to variations of the form (5.11). We have 

81F “ “h ^ 4“ L'* Sf^]dx^ 

where oi = 0 are the equations of matter and L** = 0 the 
Maxwellian equations. On substituting the variations from 
(5.11) and integrating the last term in the integral by parts, 
1 - 7)1 ^ 

UiP<o^^iP)+2:^=^o. 

I OL OXx 
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Because of the arbitrariness of the gauge the number of inde- 
pendent equations must be one less than the number of unknown 
functions ijt and f„. 


§ 6. Energy and Momentum. Remarks on the 
Interchange of Past and Future 


I. Energy and Momentum. 

The complete field equations are explicitly 

div ^ p — 0, ^ — curl § — 

OXq 


(6.1) 


Where @ and § arc the electric and magnetic field strengths : 
p ^ / ^/o .... u . . . • 9\ 

oX'dXi 'bxj' ^ ' (x\'dX2 'bxj' ’ 

p is the charge density tffifsy and the components • • • of the 
current § are given by 

= ^5i^, • • •. (6.3) 

In addition to the differential law 


^ -f div § = 0, (6.4) 

expressing the conservation of electricity, we have a vector con- 
servation law governing energy and momentum. A completely 
satisfactory expression for the tensor representing density and 
flux of energy and momentum is only to be obtained along the 
lines employed in the general theory of relativity. Here we 
give only the result for the density of energy — c • <o 
mentum (/j, t^), and in doing so we separate the material 

from the electro-magnetic part. We have for the part referring 
to matter 




i^Xnr 


-f mo^Ttf ) ; 


— 1 4- 1 

2iv 'bXi / 4 ixj’ 


( 6 . 6 ) 


We have here introduced, in addition to Sp, the operator SJ, 
{p = 1, 2, 3) which acts on all four components of i(i ; whereas 
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the former subjects ipi, ^2 to the 2-dimensional transformation 
Sp [III, (8.15)] and to — Sp, the latter exercises the 

same 2-dimensional transformation Sp on both pairs of com- 
ponents. Correspondingly 

Sp = ^Spij). 

The density of energy and momentum due to the electro-magnetic 
field is given by the familiar Maxwellian expressions 

= a(£,//, - E.HJ, • • •. , 

We find the conservation laws 


3 


-0; 




(6.7) 


as consequences of the field equations. Furthermore, the tensor 
t is symmetric — not identically, but in consequence of the field 
equations ; in this sense we have 


/? + <^ = 0(p=l,2, 3); (p, ^ = 1, 2, 3). (6.8) 


On combining these with (6.7) we obtain the divergence con- 
ditions 


y 2 ^3 ^3 ^ 2 ) 


-- 0 , 


V ^(^0 -f- ^1 ^0) _ Q 

,=o ^ 


(6.9) 

( 6 . 10 ) 


These results can all be verified directly, but their deeper 
significance can be understood only by going over to the general 
theory of relativity as mentioned above. Just as the theorem 
of the conservation of electricity follows from the gauge in- 
variance of the equations, the theorems for the conservation 
of energy and momentum follow from the circumstance that 
the action integral, formulated as in the general theory of 
relativity, is invariant under arbitrary (infinitesimal) transforma^ 
lions of co-ordinates. In this general relativistic formulation we 
need further to erect a normal set of co-ordinate axes at each 
point P of space-time, consisting of four mutually perpendicular 
directions at P (“ orthogonal ennuple ”), in order to fix the 
metric at P and to be able to describe the wave quantity ifj in 
terms of its components ; all permissible orthogonal ennuples 
at P are obtainable from each other by local Lorentz transforma- 
tions which leave P invariant. But the rotations of these local 
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ennuples can be performed in the various points P quite inde- 
pendently — the quantities at various points are not bound to 
each other as in the special theory of relativity. The symmetry 
of the energy-momentum tensor can be traced back to the 
invariance with respect to such rotations. One can in fact 
take it as a general rule that every invariance property of the 
kind met in general relativity, involving an arbitrary function, 
gives rise to a differential conservation theorem. In particular, 
gauge invariance is only to be understood from this standpoint. 
It follows from the transformation laws for ifs that its four com- 
ponents ifjp relative to the local cnnuple are determined only to 
within a common factor e^^ of proportionality, the exponent A 
of which depends arbitrarily on position in space-time ; in 
consequence of this it is necessary, in order to obtain a unique 
covariant differential for i/r, to set up a linear form Efo^dx^, which 

<x 

is coupled with the gauge factor contained in ^ in the manner 
required by the principle of gauge invariance. 

We obtain the integral conservation laws from the differential 
ones by integration. We set up the integral 

= {dV = dx.dx^dx^) 


over a section Xq = const, of space-time and find that it is 
independent of x^. — cJq — H is the energy and {Ji, Jz, J3) 

the linear momentum. The material part is, on a simple in- 
tegration by parts, 




These are Hermitian forms in the “ vector ” tf/. They again lead 
us to associate the operators with the components 

t })Xz iXz/ 

iJit Jzt Jz) of linear momentum, i.e. to the assumptions with 
which we, following de Broglie and Schrodinger, began. For the 
energy we obtain (on dividing by c) the operator 

without the additive term /o as in (5.15) ; the differential equa- 
tions of matter are therefore 
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Moreover, we must not forget that to the part due to matter 
we must yet add that due to the electro-magnetic field. 

The quantities 

which are by (6.9) also constant, are the components of the 
moment of momentum. We find from (6.5) that the part due 
to matter is 



In agreement with our earlier assumptions we here obtain the 
operator which is composed of the sum of the arj-component 

of the orbital moment of momentum and the 

l \ 7)X3 

spin moment of momentum The vector 

i®’ = |(s;, 52, 5 ;) 

is actually the spin, for in accordance with the law of trans- 
formation of both i/f pairs (i/fi, of components suffer 

the same transformation a as in the Pauli theory of the spin 
under the influence of the transformation a (spatial rotation) 
of Ug. 

On integrating equations (6.10) over the spatial section 
— const, we obtain 

which we may consider as the law of inertia of energy. The 

integral may be written /o ' where fi, ^ 2 > ^3 

the co-ordinates of the “ centre of energy ” ; the equations are 
then 

dt^ 

We thus obtain the familiar mechanical law : Momentum is 
equal to mass times velocity^ where the velocity is to be taken as 
that of the centre of energy and the mass as 1/^^ times the energy 
content of the field. Nevertheless it is advisable not to divide 
by H in defining the centre of energy, as the energy density 
— tQ is here no longer positive-definite, and we cannot be certain 
that the energy content H will turn out to be positive. 
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Our theory is a classical field theory, the quantum features 
entering only in the statistical interpretation. With this 
interpretation the field laws are concerned with a single electron. 
At the present stage of our development we can deal only with 
the additional quantities due to the electro-magnetic field by 
assuming a given external field affecting the motion of the 
particle, without the particle reacting on the field ; we must 
then surrender our Maxwellian equations. The true laws 
governing the interaction between electrons and quanta will 
only be obtained, in analogy with II, § 13, on subjecting the 
system of field equations to the process of quantization, just 
as was done by Heisenberg for any system of classical mechanical 
differential equations. 

The fact that we are led back to our original assumptions 
concerning the operators representing position and momentum 
is due to the particular expressions we have chosen for the 
action, from which the field equations were obtained ; indeed, 
it depends entirely on the part M. These original postulates 
of quantum theory are accordingly of less interest from the 
standpoint of general principles than we at first believed. But, 
on the other hand, this connection seems to indicate that M 
cannot be replaced in its role as representing the action due 
to matter. M is also responsible for the fact that the charge 
and probability densities agree, which is unconditionally re- 
quired as a guarantee of the atomistic structure of electric 
charge. These connections with the most fundamental physical 
observations thus require that the action be composed additively 
of M and further terms which are invariant not only under 
change of gauge (5.10) as is M, but also on replacing if/ by e'^ - tf) and 


/« by /„ - 


where A and are two independent arbitrary 


functions in space-time. M' and the Maxwellian action F are 


in fact of this kind. Further relativistic invariant scalars 


satisfying these conditions are readily found — indeed it is not 
difficult to set up the most general action possible with the 
quantities at our disposal. But we have yet to be convinced 
by phy.sical observation that the three quantities M, M', F 


here employed do not suffice. 


II. Electric and Magnetic Spin Perturbations. 

In order to be able to compare Dirac’s theory with the facts, 
we eliminate tjj[, tjt'^ in the same way as we did in the absence 
of the electro-magnetic field. We obtain the equation 

— V'V^ = 
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with the new definition (5.9) of V and V'. The substitutions 
Sa in two variables satisfied the equations 

= SxSo = 5 x ; 5,53 = - 5353 - fSj ; 

and consequently those denoted by the same letters but operating 
on all four variables obey 

— .Sj ; 53S3 — S352 — iSi. 

V'V contains terms of the following four types : 


(1) 



(2) 

— 


(3) 



(4) - 

- 







■. + *)}' 


We collect together terms of types (1) and (2) to form the 
“ regular term ” in which the components of ip are not coupled 
with each other : 


(- 

\7)X^0 


-) + 




[The transition from lower to upper indices, i.e. from “ co- 
variant ” to “ contravariant ” components, is performed in 
accordance with the equations /® = — /o, fP = fp{p = 1, 2, 3).] 
The irregular term consists of the electric part 

and the magnetic part 

These become, on multiplying by the factor h and expressing 
the electric and magnetic field strengths 6 and § in the usual 
units, 

im), 

We have already (II, § 12) calculated the regular term for a 

€ 

homogeneous magnetic field and found it to be - On adding 
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the regular and irregular terms we obtain, on neglecting the 
squares of the potentials, 

^(&s + e'). 

This contains the fact, which was already derived in § 4 from 

spectroscopic data, that to the spin ^ <3’, twice as great a magnetic 

moment is to be ascribed as to the same amount of orbital 
moment of momentum ; we have now obtained a convincing 
theoretical foundation for this procedure. The laws governing 
the interaction of a general inhomogeneous magnetic field with 
orbital and spin momenta emphasize still more emphatically 
the essential difference between S and The irregular electric 
term, calculated for the central-symmetric field originating in 
the nucleus, is the spin perturbation. 

The description of the electron given earlier, according to 
which it was a composite structure composed of two kine- 
matically independent parts — the electron translation, with an 
00 -dimensional system-space, and the electron spin, with a 
2-dimensional system space — is, in view of the Dirac theory, 
no longer quite appropriate. But the classification of spectra 
given there is none the less valid here, for it depends only on 
the fact that to the group of rotations of physical space corre- 
sponds the representation x © in the total system-space. 

From the field equations (6.1) as they are to be understood 
for the present, i.e. as the laws of motion of an electron in an 
external electro-magnetic field, dispersion phenomena can be 
(approximately) calculated ; they tell us how the motion of the 
electron in the normal or other quantum states is affected by 
the incident light wave. From the perturbed tft we then deter- 
mine the scattered light with the aid of Maxwell’s equations ; 
to this class of phenomena belong in particular the Compton 
and Smekal- Raman effects Spontaneous emission can be 
handled similarly if we take the considerations of II, § 13, as 
justifying the following procedure : The polarization and 
intensity of light emitted by the quantum jump n -> n' of the 
atom is to be calculated by integrating Maxwell’s equations, 

where the expressions tjnfj, for charge and current density 

are to be understood as being the 

characteristic function of the atom in the quantum state. 
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III. Interchange of Past and Fiilure. 

The action is so constructed that it is invariant under inter- 
change of right and left ; the corresponding substitution is 


%o -> Xa, 

fo /o, 

•At •A'l- 




n 


ifjo - > Ipo I 4^1 4^\y 4^2 4^2 


,)■ 


(6.12) 


Does a corresponding result hold for the interchange of past 
and future ? The foundations of the theory lead to the hope 
that it will be able to take account of the essential difference 
between the two time directions, so obvious in Nature. But 
Dirac has remarked that M, M' go over into ~ M ^ ~ M' under 
the inffuence of the substitution 


Tx, /x /x (oc — 0, 1, 2, 3) ; "j 

•Ai 'All <p2 T> 'Ai : •Ai — 'Ai, 'A 2 — •A’ i 

Hence when, in dealing with the motion of an electron in an 
external electro-magnetic field, we obtain a solution i/r which 
contains the time in the factor this substitution will lead 

us to a new s )lution which contains the time in the factor ; or, 
more precisely, a solution of the problem obtained by changing 
/ into — /. But this can be done by retaining the same external 
field with potentials ^ and replacing ^ by — We denote such 
a particle, whose mass is the same as that of the electron but 
whose charge is e instead of — as a “ positive electron ” ; it 
is not observed in Nature ! It follows from what has been said 
above that the energy levels of such a particle are — hv, where 
hu arc those of the negative electron. Disregarding this differ- 
ence in sign, the two particles behave the same. The electron 
will possess, in addition to its positive energy levels, negative ones 
as well, the latter arising from the positive energy levels of the 
positive electron on changing signs as above. Obviously some- 
thing is wrong here ; we should be able to get rid of these negative 
energy levels of the electron. But that seems impossible, for 
under the influence of the radiation field transitions should occur 
between the positive and negative terms. That we have twice 
as many terms as we should is obviously related to the fact 
that our quantity j/r has four instead of two components (satisfying 
first order differential equations). The solution of this dif- 
ficulty would seem to lie in the direction of interpreting our 
four differential equations as including the proton in addition 
to the electron. 
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The substitution (6.13) transforms the terms M, M' of the 
action into — M, — M' , but leaves the Maxwellian term F 
unaltered. Our field equations as a whole, i.e. when we also 
take into account the reaction of the particle on the radiation 
field, are consequently not invariant under this substitution. 
However, there does exist a substitution which reverses the 
direction of time and which at the same time leaves all terms in 
the action invariant. We mentioned in III, § 8 that the ex- 
pression (5.13) formed from a ^ with two components takes on 
the sign 8, ^ 2, 3) on going over from 

ipi, tpi fo 1 ^ 2 ) — ^ 1 - Hence if w is a quantity which transforms in 
the same way as tf/ then 


^ S* to -> 8„ • to Sa i/i ; 

on applying this to to = ^Xp we find that 


Hence if we make in addition the substitution 


then 


a:o 


o> 


Xp {p — 1,2, 3) 


V Tc V c / 


and consequently M, formula (5.5), remains invariant. In the 
presence of an electro-magnetic field its components must 
change signs in accordance with 

fo /o, ~ fp {P = 1) 2, 3). 

We have thus found that M, M' and F all remain invariant 
under the substitution 


Xo 12, Z) 

Jo /o, Jp-^ ^ Jpy ^ , J, , 

h ^2, 'i>2 — Vl '> Vi 'f’2 — K 

This shows that the past and the future enter into our field 
theory in precisely the same manner — in spite of the fact that 
the sign in the exponent of the time factor e"'”' of a solution of 
the quantum problem is unchanged by the substitution (6.14). 
We must of course suspend judgment as to whether the laws 
governing interaction between photons and electrons allow' us 
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to distinguish between these two directions in time until we 
have carried through the quantization (§ 12). 

§ 7. Electron in Spherically Symmetric Field 

We now proceed to the discussion of the behaviour of an 
electron in a spherically symmetric electrostatic field in Dirac's 
theory. 

I. Dirac's Conservation Theorem. 

From the definitions follow immediately the commutation 
rules : 

S,T = TS,, S,T = TS, [p = 1, 2, 3). 

We need further the results 

S[S[ ^ L 5^5; - - S:^S^ - iS[ 

and the commutation rules 

Li pi — Pi Li ~ 0, (pi?) ^ Pi Li p2 ^2 + p3 ^3 ” 0, 

L 1 P 2 p2 Pi ^2 “ ^p3} 

for the components of linear and angular momenta )j) ~ {pi, p 2 ^ p^) 
and S (-^ii -^ 2 ’ -^ 3 ) • 

In a spherically symmetric electrostatic field /i = A = /a = 0 
and /o — 0 is a function only of the distance r from the centre. 
With the aid of the formulae given above it is easily shown that 

M, = L,+ \s; 

commutes with 0, 7', (©’p) and consequently wi^h each term in 
the expression 

-H^0 + ((Bp)-\-moT (7.1) 

for the energy H. Indeed, this conservation law for the total 

moment of momentum = 2 + ^ ©’ was already known to 

us from general considerations. We further find that (@’2) 
commutes with 0 and T, but that 

(©’2)(©’P) + (©’1))(©’2) = - 2(©’p) 

or 

(©’I)){(©’2) + 1} + {(©’2) + 1}(©’I)) - 0. 

Hence (©’2) + 1 anti-commutes with (©’^) and therefore also 
with (©p) ; its commutation properties with respect to the three 
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terms of (7.1) are therefore the same as those of T. Hence on 
setting 

(©’S) + l = kT^ (7.2) 

k is a scalar which commutes with the energy H (where by scalar 
we mean invariant under the group of rotations of space). 
Consequently we can decompose the system-space of the electron 
into irreducible sub-spaces 91^ associated with the rotation 
group, in such a way that the quantity k, which we call the 
auxiliary quantum number, as well as the energy H, 
possesses a definite value in each of the sub-spaces. Now 

(©’£)=> - {L{ -f + } -I {S:,S;{L,L, - L,U) 1 - + } 

= - {S\L, -f + ) = £2 - (©’£) 

and consequently 

{(©’£) + 1}^ = £2 + (©’£) -f 1 = (s + le-)' 1 1 

9)12 ^ 7^2 _ 1 

This agrees with 

9)12 =.;(;■+ 1 ) = ( 7 . 3 ) 

when we put 

\k\=j + l. (7.4) 

Accordingly , the auxiliary quantum number k is a non-vanishing 
integer. The conservation theorem (7.2) goes beyond (7.3) in 
giving us in addition the sign of k. For a given half-integral j 

the two values k = + -f 2 ) both possible; they must 

correspond to the two possibilities I = j dr 5 of our previous 
notation. The single quantum number k replaces the two I, j-. 


II. The Differential Equation for the Determination of the 
Characteristic Values. 

Since the field is spherically symmetric, it suffices to carry 
through the calculation for the point x = 0, y — z -- r. At 
this point 
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and the Dirac conservation law (7.2) becomes 

"(It + 

together with the equations obtained from these by interchanging 
the two pairs i/r^, and <//(, of components. The differential 
equation (6.1) for the characteristic vector i/r, which contains 
the time only in the factor has as its four components the 
two 



and two others of analogous structure ; we have here written 

E = E -0 = U. 
c 

The derivatives with respect to x and y which appear in (7.6) 
can be eliminated with the aid of (7.5) ; the resulting equations 
are 



where 

0, r), - ^,'(0, 0, r). 

The remaining two equations arc obtained by writing i/tj) 
in place of [ifti, ifi'i). At an arbitrary point P = P{x, y, z) the 
first and third components of ^ satisfy the equations (7.7) in 
a rotated co-ordinate- system whose positive c-axis passes through 
P. We shall find it convenient to introduce rf and rg as variables 
in place of / and g, as 

I ^1 == (\ + i-\ f 

r dr \r ^ dr) 

If we wish to avoid the explicit appearance of i in the equations, 
we may write 

rf V iw, rg = V — iw 
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and obtain, finally, the fundamental equations 


dw ^ A 

JJy fyi y ^ _ Q 

dr r 

Uw + ^ ~ ^ — 0* 

dr r 


(7.8) 


III. Spherical Harmonics with Spin. 

Let /(r), g[r) be a solution of equations (7.7) ; then in the 
rotated co-ordinate system 

<Ai =/• P) g ‘ Py ^2 ==/. T, ifj^ = g ,r 

where the factors p, r are constants independent of r. On 
returning to the original co-ordinate system each of the pairs 
*Ai» *^2 : ^i> ^2 undergoes the transformation a associated with 
the rotation s. Consequently 


01 

'A 2 


fpi + 
fp2 + 


'1*1 = gPi + M 
'I*! =---- gPi 


(7.9) 


in which f and g depend only on r, and the factors p, t only on 
direction, i.e. on the spherical co-ordinates 0, <f> introduced by 
setting 

X iy — r sin 6 e'^, z = r cos 0 ; 


the coefficients in (7.9) must further satisfy the conditions 

/)i(l — cos 0) — p 2 sin 0 e~'^ = 0, C^ ^^) 

Ti(l + cos 0) + T 2 sin 0 e~ — 0. 

On substituting the expression for £ in polar co-ordinates 
[II, (4.10)] into the Dirac conservation law, we are led, with 
the aid of (7.9) and (7.10), to the differential equations 

sin 0::^ + i + /^(l + COS 0)ti — 0, 

sin — f-:^ — ^(1 — COS 0)pi = 0. 

do 0<p 

We have thereby accomplished the transformation of the Dirac 
wave equation into polar co-ordinates. (7.9) corresponds to the 
substitution tft = f{r)Y i of the scalar theory; in place of the 
single factor / depending only on the distance r we have here the 
pair /, g and in place of the surface harmonic Yi depending 
only on the direction we h^ve the matrix 

Pi ^1 
Pi ^2 




SPHERICALLY SYMMETRIC FIELD 231 

The equations (7.11), together with the conditions (7.10), define 
the “ surface harmonics with spin of order k ” ; they are quite 
independent of the potential <P. The characteristic values E of 
the equations (7.7) or (7.8) are the energy levels associated with 
quantum number k. 

As in the theory of the ordinary spherical harmonics, we 
here again seek out those spherical harmonics with spin which 
contain the meridian angle only in the multiplicative factor e'”* ; 

Pj = (sin 6)-"' • P, T, = (sin 0)~”* • Q. (7.12) 

Substituting these expressions in (7.11) and taking z — cos d as 
the independent variable, we find 


(1 - + kQ, 

(1 + 2 )^ mQ - kP. 


(7.13) 


We denote the solutions P, Q of these equations which lead to 
non-singular functions p, r on the sphere more precisely by 
P^\ It suffices to consider the case k> 0, for (— P, Q) 

is a solution of the equations obtained by changing k into — k ; 

p(-)(2) = - P<«‘)(.)^ Q(«)(2) Q(».)(^). (7.14) 


Furthermore, 

dP^"') 

~dr 


P(m-I) 

’ dz ’ 


for the derivatives of satisfy the differential equations 

(7.13) with ni — 1 in place of m. For m — — k, P — 1, Q — — I 
is a solution which satisfies all continuity requirements on the 
sphere, since the multiplicative factor 

(sin -- [x — fy)-"* 

is finite for negative m. Consequently we find polynomial 
solutions of (7.13), the degrees of which arc 0, 1, • • •, 2/j — 1 
corresponding to the values m = — k, — k 1, \ k — 

The solution for m = ^ — 1 is 

P{z) = (1 - 2)*^-^! + z)\ Q[z) = (1 + 3)*-‘(l - S)*. 

We thus finally obtain the following explicit expressions for the 
spherical harmonics ivilh spin : 

= ^p{(i - + >!)*}. et"'w = - *■>‘1 

(7.16) 
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where p = k — \ — m. They behave very much like the 
ordinary spherical harmonics. The following equations are 
also of importance : 

p(:«)(_ 2) === (._ i)p . z) - (- ly . p["'\z) (7.16) 

§ 8. Selection Rules. Fine Structure 

/. Selection Rules. 

In a solution «/i defined by (7.9), (7.12) t/ii, like pi and tj, 
contains (f> only in the factor e'”'^ and ipz, like p2 and Tj, only in 
the factor ; correspondingly for Hence 

The 2-component of the moment of momentum in the state 
{kj m) is accordingly m + ^. This change in the meaning of 

the quantum number m is to be carefully noted : m + ^ runs 
through the values 

' • •’ “ ^ ^ = 7' y - 1. ■ ■ ■. -]■ 

as it should. 

In order to obtain the selection rules for the possible transi- 
tions [k^ m) {k\ m') and to obtain the corresponding intensities 
we must calculate the matrix which represents the energy of 
interaction between the atom and radiation in terms of the 
co-ordinate system determined by the characteristic functions 
defining the quantum states n of the atom. Proceeding 
as in n, § 13, we see from (5.15) that this matrix is 

p=i 

The vector e here plays the same role as 4 there. The in- 
tensities are essentially determined by the elements ©(nn'). 
the three components of which are 

S„{nn') - 

The selection rules are merely consequences of the fact that 
© is a vector. We first obtain the old result for m and j from 
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considerations involving the proper rotations of space. The 

rule for j asserts that the auxiliary quantum number k may go 
over into 

±k, ±(^+1). (8.1) 

To the reflection i corresponds the interchange T of the two 
pairs 02 ), ( 01 , 02 ). In polar co-ordinates this reflection 

consists in the transition from ( 0 , <f>) to [tt — 0 . 77 + 0 ) ; z — cos 9 
is thereby transformed into — z and the factor takes on the 
sign (— 1 )"^. In accordance with (7.15) and the expre-^sions 

for pi, Ti ; p 2 ) '^2 results in an interchange of pi, with 

possible change of sign, as represented by the substitution 


0 1 


0 1 

1 0 

= (- 1 )*=-^ 

i i 

1 0 


and the same for p 2 , T 2 . By (7.9) we therefore have for 0 with 
auxiliary quantum number k : 

TiPi- X, - y. -- z) ^ (- y, z). 

The sub-spacc 91*. thus has tlie signature 8 = (— ; this 

result was derived under the assumption > 0 . On replacing 
khy — k and applying (7.14) we find in place of (7.16) : 

m{- 2) = (- en(- = (- 


The signature corresponding to auxiliary quantum number 
— k {k> Qi) \s accordingly (-— 1)^. On setting 


I ^ — k when k is negative (^j = 
I k — 1 when k is positive (^j 


k — ~ ^ I 
^ 2 ^ 27’ 


( 8 . 2 ) 


both possibilities are included under 8 == (— 1 )\ or we could 
also write 8 — sgn k • (— l)^“h The only coeffleients occurring 
in a proper vector are those corresponding to transitions in 
which the signature is reversed. Our selection rule ( 8 . 1 ) for 
k is thus narrowed down to 


k -> k-1, -k,k+l. (8.3) 


The following table gives the value of the auxiliary quantum 
number k associated with each possible combination of I and j : 



i=‘+i 1 


1 2 3 4 • • • 

_1 _2 _3 -4 • • • 

2 3 4 5 • • • 
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II. Transition to the limit c-> oo. 

In order to return from relativistic to ordinary mechanics 
we must pass to the limit c-^ oo. Before applying this to 


equations (7.8) we must replace U, i> by wio + I we then 

have, on neglecting ^ in comparison with 


uv = (j* + 

~)v : 


on eliminating w we obtain 

_ Jl(A .1 

2m \dr 


Uv 




or 


h /d^ 
2m \dr^ 


k{k 1) 


+ Uv = 0. 


On introducing I by (8.2) wc have in both cases k{k — 1) = /(/+!)• 
Hence in the limit terms with the same /, and therefore those 
with auxiliary quantum numbers k and — — 1, coincide with 

that one associated with azimuthal quantum number / in the 
scalar theory of Chapter II. The doublet found in alkali spectra 
— and in general the multiplet structure of spectral lines — is 
accordingly explained as a relativistic phenomenon. 


III. H, He^ • • •. 

In a Coulomb field with nuclear charge Ze we have 


-0 = 


Za 

4 ^^ 


employing Heaviside units, which are better adapted to a field 

theory. In the following calculations we shall denote the 

Zoc oc 

multiple of the fine-structure constant simply by a itself, 

and we shall set Wq c = vq. In order to integrate equations 
(7.8) we first perform the substitution 

V = e~^' • F, w = • G, 


where j8 is a positive constant. Our equations are then 


(8.4) 
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Our method will lead to a solution if we choose the constant ^ 
in such a way that the determinant of the linear combinations 
of F and G on the right vanishes : 

(c) ” (r )' + (8.5) 

We now seek a power series solution 

where the exponent /x begins with an initial value /xq and runs 
through the values /Xq, /Xq + 1, /Xq + 2, • • •. On substituting 
these in (8.4) we obtain the recursion formulae 

(/X -[- (j — 

^ ( 8 . 6 ) 

a + (/X - k)a^ = Q + 

The initial exponent /x /xq is determined by the fact that the 
determinant of the coefficients of on the left must vanish 

for this value of the index : 

/x^ — + a- “ O'; /.to ~ V — OL^. 

Because of the manner in which jS was determined in (8.5) there 

V V(\ 

exists a linear relation, with coefficients-' T 8 between the 

cc 

right-hand sides of (8.6) which is satisfied identically in 
Hence for all /x 

(" H 7) [(/^ + k)b, - a a,] + P[a b, + = 0 

or 



~”) it'- + ^) -f- « ^ 


+ - k) - 



The power series will break off with the term with exponent /x 
if on replacing b^_i by a^, the right-hand side of (8.6) is 
made to vanish. The condition for this is that 



(8.8) 


it will be satisfied in virtue of (8.7) if the determinant of the 
coefficients in these two equations vanishes : 






= 0 
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or by (8.5) 




0 . 


- 

cfi a’ 


Since the exponent fx with which the series break off must be 
of the form /Aq 4“ where m is a positive integer, we obtain the 
fine structure formula 


(8.9) 


The solution ijt of our differential equations, for the char- 
acteristic values V — cE defined by (8.9), is of the form 

g-pr . ynt . (polynomial of degree n in r) 

and satisfies the condition that the spatial integral of \ip\^ con- 
verge in the neighbourhood of the singular points r = 0, oo. 
These E consequently constitute the discrete term spectrum of an 
ion with nuclear charge Ze and having but one electron outside 
the nucleus. If we neglect the small constant a in comparison 
with k, E depends only on n + 1/j|. This fine structure formula 
further tells us that the two terms with auxiliary quantum 
numbers k and — k, or the two terms with the same j and for 

which I = j ± exactly coincide. That this is in fact found 

to be the case has already been mentioned in § 4. Equation 
(8.9) has had a remarkable history. It was first derived on the 
basis of the older quantum theory by Sommerfeld and, at about 
the same time, verified by the experiments of Paschen ; it was 
perhaps the greatest triumph of that theory, next to Bohr's 
explanation of the Balmer series and his calculation of the 
Rydberg number from universal atomic constants. The new 
quantum theory at first destroyed this beautiful agreement, 
as in its scalar form it led to (8.9) with the half-integral quantum 
number j in place of the integral [/j]. Sommerfeld’s original 
formula was only completely re-established with the advent 
of the Dirac theory here discussed. The quantum number k, 
which was used in the older quantum mechanics in place of I 
and which may assume the value 0, has also re-appeared and 
is now supplied with a sign. But on the other hand, the number 
of components in the fine structure is now greater than in 
Sommerfeld’s theory, as in addition to the transitions k->k—l, 
/s + 1 we may now also hav'E &->—/?; this addition is also in 
agreement with experiment. 
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Our conclusion that (8.8) was to be satisfied in virtue of 
equation (8.7) for the unknowns a^, b^, assuming that the deter- 
minant of the two equations vanished, fails when both coefficients 
of equation (8.6) are zero : 

V + Vq ^ ^ n — k 

rj3 /A + a ’ 

It follows from this that then fi — or n = 0, and that 

ft k < 0, or k < 0. There actually exist no terms u = 0, 
k — — 1, — 2, • • •. For the coefficients a^,, of the beginning 
term in the corresponding solution, which is at the same time 
the end term, would by (8.6), (8.8) necessarily satisfy the equations 

(fj- T" k)bi^ — ctafi = 0, ceb^ -f- (^ — ■ hi)a^ — 6, ^ b^ — - — a^^ = 0 

or 

\ k OL ' 

and this is impossible because of the condition |v| < 

In accordance with the foregoing we may describe the normal 
state of the hydrogen atom; n = 0, k — I (/ = 0), as follows. 
We take the quantum number m, which may assume either of 
the values 0, 1, to be 0. Let a = 0*532 A. be the radius of 

the first Bohr orbit and a = 7*29 • 10“^ the fine-structure con- 
stant. ifji, ^2 obtained by multiplying the radial 

function 

A(r) = 

with the factors 

(1 -f- -y/ l — g^) + ioL cos 0, ia. sin Be"*’ I tpi, 

(1 + Vl — g*) — fg cos 0 , — fg sin 0 ^i', ^ 2 - 


We find from these expressions that the probability density ifj tf> 
is distributed spherical-symmetrically in accordance with the 
law 

= [A('')]^ 

The normalization is here not chosen in such a way that the 
integral of p over all space is unity ; it is actually 



1 -l- 2 Vl - a« 


• r(l + 2\/l - g2). 


We have already seen that in a certain sense the probability 
density multiplied hy — e represents the distribution of charge 
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in the atom. Considering the probability current as deter- 
mining the convection of this continuous charge distribution p, 
we find that it represents a circulation about the a-axis with 
velocity a c sin 0 (a c is the velocity of the electron in the first 
Bohr orbit on the older theory). On giving the axis of rotation 
all possible directions ^ runs through the 2-parameter family 
of characteristic solutions for which n = 0, /? = 1 ; we may 
take as a basis for this family of solutions the above (m = 0) 
and that for which m — — 1, representing a circulation in 
the opposite direction. 

C. The Permutation Group 

§ 9. Resonance between Equivalent Individuals 

The Hermitian forms Q, which represent in system -space all 
possible physical quantities of a given system, constitute a 
totality E within which addition and multiplication is defined. 
If E were reducible we could choose our co-ordinate system in 
system-space in such a way that all Q would be simultaneously 
completely reduced ; these individual parts into which the whole 
would be divisible would then each constitute solutions of the 
quantum pt'oblem which were merely accidentally joined to- 
gether to form the given solution. In accordance with the 
fundamental Aristotelian postulate of “ nihil frustra ” Nature 
could hardly be expected to indulge in such a superfluous luxury. 
Hence we propose the thesis that S is an irreducible system. On 
introducing as fundamental quantities the canonical variables 
as in II,- § 11, this assumption contains the requirement that it be 
impossible to choose co-ordinates in system-space in such a way 
that the 2/ matrices qi, ’ ' qf Pi, ‘ Pr simultaneously 
completely reduced. This postulate is to be added to the Heisenberg 
commutation rules as an essential supplement. 

In accordance with Burnside’s theorem [III, § 10], which 
we carry over without scruple from spaces with a finite number 
of dimensions to those with infinitely many, the irreducibility 
postulate allows us to assert that there can exist no linear 
homogeneous relation tr{AQ) = 0 between the components of 
Q which is satisfied for all Q. Since in the domain of the Q’s 
not only is multiplication possible — as presupposed in Burnside’s 
theorem — but also addition, we arrive at the conclusion that all 
Hermitian matrices in system-space are contained in E. It is 
perhaps desirable to express our requirement directly in the 
form : any Hermitian form represents a physical quantity of 
the system. In accordance with II, § 7 there is associated with 
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each statistical ensemble a positive definite Hermitian form A 
in such a way that tT{AQ) is the expectation of the quantity 
represented by Q. Burnside’s theorem asserts that the equation 

tr (AQ) = tr (A'Q) 

can be satisfied for all Q only if A — A', or it is impossible to 
distinguish between the two statistical aggregates represented by 
the positive definite Hermitian forms only if A — A' . In particular 
it follows from this that the states represented by two rays in 
system space are physically different if the two rays are distinct ; 
this was to be expected, or even required, from the outset. 
These consequences show the naturalness and cogency of the 
irreducibility postulate, from which it can conversely be deduced. 

The stales of physical entities I which are fully equivalent, as, 
for example, the electrons in an atom, are to be represented by 
vectors j = {x^ or rays in the same system-space 91. If two 
such individuals unite to form a single physical system P the 
vectors of the corresponding system-space 9i X 91 = 91* are, 
in accordance with the general rule of X -multiplication, the 
tensors of order two. But, by III, § 6, 91* is reducible into 
two independent sub-spaces {91*} and [^*], the space of anti- 
symmetric and the space of symmetric tensors of 2nd order. 
Physical quantities Q of /* have only an objective physical 
significance if they depend symmetrically on the two individuals. 
This requirement is expressed in terms of the elements of the 
Hermitian form 

Q = i f ^it 
by the symmetry condition 

tv = <lik,i'f (9.1) 

On reducing (a;,*) into its anti-symmetric and its symmetric 
parts, 

Xik = x{ik} -4- x{ik) (9.2) 

Q is reduced, in virtue of (9.1), into two Hermitian forms in 
x{ik} and x{ik) respectively. For on substituting (9.2) into Q 
we obtain four terms : those in which {91*}, [91*] intersect them- 
selves, and the two in which {91*} intersects [91*] or conversely. 
These last two then vanish, for if we interchange the dummy 
indices i with k, i' with k' in 

[Q] = Sqi^i,^,x{ik}x{i'k') 

and then replace 

x{ki), x{k'i') by 


— x{ik}, x{i’k’) 
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we find [0] = — [Q], or [Q] — 0. The totality of Hermitian 
forms Q which represent the quantities of P depending sym- 
metrically on the two individuals is therefore not irreducible ; it 
can be reduced in accordance with the decomposition 

912 _ (9 3 ) 

of the space 91“*. 

In particular, every possible mteraction between the two 
individuals depends symmetrically on them, even when other 
physical elements, such as a radiation field, are also involved. 
Hence if P is at any time in a state contained in one of the 
sub-spaces {91*} or [9i*] it is for all time impossible to get it out 
of this sub-space by any influence whatsoever. Again, we expect 
Nature to make use of but one of these sub-spaces, but the 
irreducibility postulate offers us no clue as to which one she 
has decided on. 

Take as co-ordinates in the system space 91 of the individual 
I the principal axes Cj of the energy associated with the char- 
acteristic numbers E,-. Disregarding the interaction between 
the two individuals for the moment, the system P has as energy 
levels Ei + with characteristic vectors e,- X = 6,^ ; each 
characteristic number of the type + E^ appears twice, and 
the corresponding characteristic space is spanned by the vectors 
ei 2 and Cn. On introducing the interaction as a small per- 
turbation the two states Ci 2 and e^i are in resonance with each 
other. Denoting the components of the total Hamiltonian 
function by H(ik, i'k'), the transformation of the sub-matrix 

H(1 2, 1 2) N(1 2, 2 1) I 
H(2 1, 1 2) H(2 1, 2 1) 

to principal axes, as required by perturbation theory, can in 
the present case be performed in a manner which is universally 
valid ; we need only to replace the fundamental vectors e, 2 , Cj, by 

^2i)) ^^^(^12 "h ^2i)- (9.4) 

Denoting H(1 2, 12) = H(2 1, 2 1) by hv and the numbers 
H{\ 2, 2 1) = H{2 1, 1 2), which must be real in virtue of the 
condition H{\ 2, 2 1) = H{2 1, 1 2) of Hermitian symmetry, by 
ha., the resonance equations become 

+ {v Xi2 + « X 21 ) = 0 , 
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om which it follows that 

- a )(«,2 — Xu ), 

+ a)(Ari2 + Xi^. 

aking as initial conditions x^^ = 1, x^y = 0 for < = 0, we find 

«i 2 — ^*1 = ^ 1 * + ^21 = : (9.5) 

1 ^12 1 * — COS® 0 . 1 , I x^i I® = sin® ot. 

/e see from this how the two states 612 , 621 alternate back and 

)rth with the beat period whereas the components (9.5) 

long the axes (9.4) have always the same constant absolute 
lagnitudes. 

The only characteristic numbers associated with the system 
Dace {9fl®} are those of the type + E^, each of which appears 
Kactly once, but the sub-space ( 9 R*] has simple characteristic 
umbers of the type 2Ex in addition to these. Hence if Nature 
ecides in favour of {91®} both individuals can never be sim- 
Itaneously in the same quantum state with energy Ej— assum- 
ig this energy level for the individual system is non-degenerate, 
hat El -\- E 2 occurs only once in {91®} and only once in [91®] 
leans : the possibility that one of the identical twins Mike 
nd Ike is in the quantum state E^ and the other in the quantum 
:ate E^ does not include two differentiable cases which are 
ermuted on permuting Mike and Ike ; it is impossible for 
ither of these individuals to retain his identity so that one of 
lem will always be able to say “ I’m Mike ” and the other 
I’m Ike.” Even in principle one cannot demand an alibi 
[ an electron ! In this way the Leibnizian principle of coin- 
dentia indiscernibilium holds in quantum mechanics.^® 

On passing from 2 to f equivalent individuals I it is not so 
isy to reduce the representation (c)/ of the complete linear or of 
le unitary group in system-space 91 into its irreducible con- 
ituents ; we shall go into this matter in the last chapter, 
evertheless we know from III, § 5, that the anti-symmetric 
rid the symmetric tensors of order / with components 

X{klki ‘ • • kf], X{kyki • • • kf), 

;spectively, each yield such an irreducible representation. 

physical quantity Q of the total system U which depends 
r^mmetrically on all / individuals will be represented by an 
[ermitian operator Q, the coefficients q{kiki ' ' ‘ kf ] k'lk^ •••&/) 
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of which are unchanged on subjecting kik 2 • • • kf and k^k'^ ' * • kf 
simultaneously to the same permutation. It is evident that such 
an operator always sends an anti-symmetric tensor x[k^k 2 * * * kf\ 
into an anti-symmetric tensor x' : 

x'{kik2 • • • ^/ } “ Zq{k^k2 ' * * kf\ k\k[y • • • k'f)x{k[k2 • • * k'f], 

k' 

Hence the sub-space {W} of anti-symmetric tensors is reduced 
out of the system-space W of /-f, determined in accordance with 
the general rule of X -multiplication, in such a way that if // 
is ever in the system space [W} it remains there forever, regard- 
less of what influences may act upon it. The sub-space [W] 
of all symmetric tensors x{k) of order J can similarly be separated 
out of W. The energy level + ^-2 + ‘ * ’ + ^/, which is 
/!-fold degenerate in appears in {W} as a simple level. Only 
characteristic numbers of this type appear in {W}, but the 
characteristic numbers of [W] are all numbers which can be 
obtained by summation of / distinct or non-distinct energies E, 

If the system space is ^^-dimensional, {W} is only possible 
if / ^ n. If E is an n-fold energy level of the individual I then 
the quantum states with energy E constitute an 7z-dimensional 
sub-space If it should happen that only {W} is realized 

in Nature, then in view of the foregoing it would be impossible 
to have more than n individuals of the system V in the quantum 
state E, 

The reduction of W to {W} or [W] involves relationships 
which frustrate any attempt at description in terms of our 
old intuitive pictures with their orbits and billiard ball electrons. 
But the difficulty enters already with the general composition 
rule, according to which the manifold of possible pure states 
of a system composed of two parts is much greater than the 
manifold of combinations in which each of the partial systems 
is itself in a pure state. 

§ 10. The Pauli Exclusion Principle and the Structure 
of the Periodic Table 

One of the most fundamental facts of Nature, the ordering of 
the chemical elements in the periodic tahle^ can be understood 
only with the help of these considerations. We go from one 
atom to the following, which we denote by in two steps : 
the first is preparatory and consists in increasing the charge 
on the nucleus by 1, and the second aad final step consists in 
adding an electron to the ion so obtained. To obtain the 
normal state of A this additional electron must be bound as 
tightly as possible, i.e. the energy of the total system A must be 
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minimum. If we disregard the mutual perturbations of the 
electrons for a moment, although they may be very considerable, 
we might expect to find every electron in an unexcited atom in 
the lowest energy level, i.e. with principal quantum number n ^ 1. 
But instead we find the following : The 1 electron of H and the 
2 electrons of He are in the Is orbit, i.e. they are in the quantum 
state n I ~ 0. But the next 2 electrons, which are added 
in going over to Li, Be, are in a 25 orbit, and the additional 6, the 
addition of each of which gives rise to one of the elements from 
B to Ne, enter the 2p orbit. Then follow Na, Mg, each with a 
new electron in the 35 state, the elements from A1 to A, the 
additional electrons entering the 3p orbit, etc. These facts 
are readily seen on writing the wave number of the lowest 
S term in the form — A7n‘‘^* ; in II, He, Li the “effective 
quantum number’’ has the values 1*00. 0*74, T59. That 
n* sinks on going from H to He is understandable in view of 
the “ screening ’’ effect of the original electron on the new one. 
We should expect that if the next electron also went into the 
orbit — 1 the corresponding value of would be something 
like 0*59, but we find instead a number which is greater than 
this by unity. The same occurs on going from Be to B or from 
Mg to A1 ; the normal states of these atoms are formed by the 
valence electrons entering 2p or 3/> orbits because the 25 or 35 
orbits are already “ occupied,’’ and if the valence electron is 
raised to an s state by excitation, it can only be raised to one 
for which n ^ 3 or n ^ 4.* Obviously the essential features 
of the regularities expressed in the periodic table depend on this 
mysterious niimerus dausus for the various states with principal 
quantum numbers n — 1, 2, • • • and on the fact that in conse- 
quence of this the electrons in the atom are added on in definite 
layers or “ shells.’’ Stated more precisely, in an ns orbit 
(n ” 1, 2, • • •) there is room for but 2 electrons, in an np orbit 
(n “ 2, 3, • • •) for but 6 ; in general the situation is described 
by Stoner's rule : there can be at most 2(2/ + 1) electrons in a 
state with quantum numbers n, /. 

On taking into account the duplicity caused by the spin we 
sec that this number is exactly the dimensionality of the sub- 
space 9i(n/) in the system space of a single electron. Neglecting 
the spin perturbation, which is indeed much smaller than the 

♦ The physical significance of the “ true principal quantum number ** 
n is contained in these considerations : we think of the term in the Hamiltonian 
function which represents the energy of interaction between the various 
electrons as multiplied by a numerical factor X and let A decrease steadily 
from I to o ; this virtual adiabatic process sends each electron into a definite 
hydrogenic orbit with a principal quantum number n, the “ true quantum 
number ” of the electron. 
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mutual perturbations of the electrons, the energy level associ- 
ated with this sub-space is 2(2/ + l)-fold degenerate. This 
degeneracy can be removed by the introduction of the spin 
perturbation and a weak magnetic field ; the energy level is 
then broken up into 2(2/ + 1) simple components distinguished 
by the quantum numbers 

j m = y, ; — 1, • • •, — j. 

Stoner’s rule led Pauli to postulate the exclusion of equivalent 
orbits : it is impossible for ii 2 )o electrons in an atom to be simul- 
taneously in the same quantum state (n, /, m). This shows 

that W is obviously not the system space of the physical system 
/f in which / electrons revolve about a fixed nucleus, but that 
the reduction to {W} takes place : Nature has decided in favour 
of the reduction to the space of anti- symmetric tensors^ at least in 
the case of electrons. In view of the considerations advanced in 
the previous paragraph this principle leads conversely to Stoner’s 
rule.^® 

If the formation of one atom from the preceding one were 
an entirely regular process the occupation of the various states 
would take place in accordance with the following table, the 
lower row of which indicates the number of electrons captured, 
on going from atom to atom, by the orbit immediately above : 

Is ; 2s^ 2p ; 35, ip^ M ; 45, 4^, id, 4/ ; • • • 

2; 2 + 6; 2 + 6 + 10; 2 + 6+ 10+14 ;••• 

This would indeed be the case if we could increase the charge on 
the nuclei by some large fixed amount, for the mutual perturba- 
tions of the electrons could thus be made arbitrarily small in 
comparison with the Coulomb attraction of the nucleus. But 
even a rough calculation shows that these perturbations are 
actually too considerable not to lead to displacements in the 
above table, i.e. to changes in the order ?ii Which the various 
shells arc filled. For example, after the 3p shell is filled, which 
is accomplished with A, the next 2 electrons go into 45 states 
to form K, Ca, and only then do we find electrons entering the 
M orbits to form Sc, Ti, • • •. For details consult the books 

by Hund, Pauling and Goudsmit or Ruark and Urey mentioned 

in the Introduction. 

It is not the purpose of this book to report on the extensive 
empirical data of spectroscopy, nor to show how the two main 
principles required to lead beyond the general scheme of quantum 
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mechanics to the interpretation of spectra were wrested from 
this material ; I here refer to the introduction of the inner 
quantum number; in addition to the azimuthal /, or the spinning 
electron, on the one hand, and to the reduction of W to {W} 
by means of the Pauli exclusion principle on the other. 
Millikan begins his report to the American Philosophical Society 
on “ Recent Developments in Spectroscopy ” [Proc, Am. Phil. 
Soc. 66 , p. 211 (1927)], with the words : “ Never in the history 
of science has a subject sprung so suddenly from a state of com- 
plete obscurity and unintelligibility to a condition of full illu- 
mination and predictability as has the field of spectroscopy 
since the year 1913/’ The theory of groups offers the ap- 
propriate mathematical tool for the description of the order 
thus won. 

The lines of the optical spectrum are caused by quantum 
jumps of the electrons which are most loosely bound. In the 
alkalies Li, Na, K, • • • the one involved is accordingly in the 
state 25, 35, 4>9, • • •. We also understand why their cores 
Li^, Na^, K\ • • • arc spherically symmetric, and therefore 
why their spectra may be approximately calculated in terms 
of the motion of an electron in a spherically symmetric field ; 
the real reason behind this is the following. That an electron 
has the quantum numbers w, I means that its state is in a 
sub-space of A ~ 2(2/ + 1) dimensions. The sub -space 
X X • • * X 91 J with A factors, as obtained by the anti- 
symmetric reduction of is I- dimen si 07ial and the rotation 
group induces in it the 1-dimensional identical representation ; 
\.c. a shell consisting of A electrons in the state n, / acts spherical- 
symmetrically ; its presence does not increase the manifold of 
terms. Hence the “ closedness ” of those elements with which 
a shell is completed ; the rare gases, which precede the alkalies, 
are elements of this kind. But we should also expect Cu, Ag, Au 
to have alkali-like spectra, as they contain but a single electron 
in the 5 state, while all the others are bound more tightly in 
a “ closed ” configuration with an external field which is spheri- 
cally symmetric. The valence of the elements must obviously 
find its explanation in these terms ; indeed, it gave the clues 
which originally led to the discovery of the periodic table. 
But only in recent times have we been able to call on the assist- 
ance of spectra, interpreted and arranged with the aid of atomic 
theory by Bohr and others, and they have verified the principal 
features of the table, while modifying, supplementing and 
improving its details. 

The consequences of the Pauli principle for the term analysis 
of atomic spectra will be discussed in detail in Chapter V, 
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particularly in § 15. Wc here mention briefly the results for 
the case of 2-electron spectra /= 2. 

Just as the alkalies may be treated as if they were but 
1-electron atoms, in dealing with the alkaline earth metals we 
need only take into account the two most loosely bound electrons 
which occupy an 5 orbit outside a spherically symmetric closed 
shell. As before, we obtain one singlet and one triplet term 

(n/, n'V ; L) 

whose total azimuthal quantum number L assumes the values 

L - / + - 1, • • •, 1 / - I 

assuming that the two quantum states (n/), {n'V) of the individual 
electrons are distinct. The only difference is that now such 
a term appears only once, whereas before it appeared twice, 
corresponding to a permutation of the electrons. The situation 
is, however, more complicated if [nl) — (^C/'). The only singlet 
terms 

{nl ; nl ; L) 

which actually occur are those with even L ^ 0, 2, • • •, 2/ and the 
only triplet terms are those with odd L 1, 3, * • *, 2/ — 1. This 
rule is thoroughly in accord with the empirical data. 

The best-known lines of the spectra are those arising from 
transitions in which only one electron is not in the normal state 
and is jumping between higher energy levids. Hence if one 
of the two electrons (not saying which !) is in the normal state 
n' — no, — 0 (no = 1, 2, 3, 4, • • • for He, Be, Mg, Ca, • • •) 
we have L = / and the two quantum numbers (n, /) sufiice to 
determine the singlets or triplets. The lowest vS term (L = 0) 
of the singlet system has the principal quantum number n = no, 
but there is no such term in the triplet system ; it begins with 
n = no 1. We find that the lowest 5 term in such a triplet 
system (which is, as we know, simple), e.g. in the spectrum of 
Mg, actually does lie in the neighbourhood of the second lowest 
S term of the singlet system instead of the lowest. 

§ 11, The Problem of Several Bodies and the Quantiza- 
tion of the Wave Equation 

In this paragraph we depart from our usual terminology 
and denote the number of individuals by n instead of /. We 
first consider more fully the reduction of 31” to [31”], for we shall 
find that although it does not apply to electrons, it does to 
photons. Let H = ||//a^ll be the Hamiltonian function of an 
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individual. The variables ^(nj, n 2 , • • •) of the unitary space 
[31”] behave like the monomials 

(n, -f n 2 + • • • = n), (11.1) 

of degree n which are formed from the components of an 
arbitrary vector in 9R ; we denote this monomial (11.1), without 
the denominator, by W 2 , * • •)• We shall have occasion 

to use the differentiation formula 

x^ •••)-'- ('^1 • * • dx,) 4 - {n^ • ‘ dx^) + * • * . 

In the absence of interaction between the individuals we obtain 
from 

I (it ^ 

the equation 

— J ^{Hy, Uj, • • •) =--■■ — L »2- • • •, + 1, • • •) 

i p 

+ zp ^2 — !,•••, Hft 1, • • •) 

P 

+ 


In the sum on the right — 1, ng, * • *, + 1 , * * *) ^^ 1^ 

be interpreted as <^(ni, ^ 2 , • • •) for jS — 1 ; similarly for the 
term with j3 = 2, etc. We can also write this equation 


l<f>{ny, Hn, 


) Xn, • 4>{ni, )u_, • • •) 

“f" 1) ’ ’ * * *)• 

.X 


On introducing the binomial coefficients in accordance with 
(ll.l) we obtain as the equations of motion 


\dtj3[ nyy ^2, 

i dt 


En^ fLx • {ni, n2, • • •) 


+ f )^Ax/^ ’ *A(* * ') ' ’ *) * * *)• 


These equations are of the form 

. U./. __A 




(11.4) 
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where the matrices are defined by 




« 1 , « 2 , 


|n„ if n'i= til, «2 = W2. • • • 
\0 otherwise 


(11.5) 


and for a =t= jS 

« 2 . 


« 1 , « 2 , 


jVn„(«^ + 1) 

lo 


(11.5') 


where the first alternative holds when all n' == n with the ex- 
ception of — I, 7t0 = Up I and the second in 

all other cases. H is, as it should be, an Hermitian matrix. 
If H is in diagonal form the fundamental vectors forming our 
co-ordinate system are the quantum states of the various in- 
dividuals ; l«/i(ni, n 2 , • • •)|^ is then the probability that there 
are simultaneously individuals in the first quantum state, 
^2 in the second, etc. On reduction from 91” to [9t”] it becomes 
impossible to identify the individuals as Mike, Ike, • • • and we 
therefore may not ask for the probability that Mike is in the 
state, Ike is in the • • •. If we have in addition to // a 
perturbation eW affecting the individuals (and symmetric with 
respect to these individuals), then equation (11.3) governs the 
change of the probabilities l<^(ni, * * •)1^ in time. 

The Hamiltonian function H reminds us of the one which we 
obtained in Chapter II, § 13 by quantizing Maxwell’s equations ; 
there the individuals were photons. Maxwell’s equations are 
to be considered as the quantum-theoretical wave equations of 
an individual photon. If we replace the photon by an individual 
whose state {x^i) varies in accordance with equation (11.2) we 
are led to a new way of treating the problem of several bodies, 
which we call the “ method of second quantization ” in contrast 
to the “method of composition” or “ X multiplication ” de- 
veloped in Chapter II, § 10. In this we consider (11.2) as the 
classical equations of motion of a physical system whose canonical 
variables are the real and imaginary parts of and as 

such subject them to the process of quantization.^® We here 
tie on to the development given in Chapter II, § 11. Introduce 
the complex quantities 

^ - ip«) 


into the Hamiltonian function H as independent variables in 
place of qa, ; the Hamiltonian equations are then 

dx. . H dx„ . 7) H 
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In order that (11.2) may be considered as the classical equations 
of motion of a system with infinitely many degrees of freedom, 
in accordance with our programme, they must be of the form 

(11.6). But this is in fact the case ; the Hamiltonian function 
is then 

H = ZH„pX„Xp. 


In quantizing x^, are to be replaced by Hermitian conjugate 
matrices x,, x„ which satisfy the following commutation rules : 


x„Xp -• XpX^ 0 , 
x„ Xp — Xp x^ 



The Hamiltonian function H then becomes the matrix 


(11.7) 


n = ZH„pX„Xp-, ( 11 . 8 ) 

cK,/? 

if H is in diagonal form then 


H = X,. 

<x 


We are here dealing with an infinite set of oscillators, the in- 
dividual members of which arc distinguished by the index a ; 
the energy of the is given in terms of the complex co-ordinates 
^(x hy ^<x ^(X' 

The quantum theory of a single oscillator as developed in 
II, § 3 gives us as the irreducible solution of 

XX — XX = 1, 


where x, x are two Hermitian conjugate matrices normalized 
in such a way that the energy xx is in diagonal form, the matrices 

x{n, n -f 1) \/n + 1, x[n, n — ^ -y/ n ; xx{n^ n) = n, 


all other components vanishing ; the quantum number n assumes 
the values 0, 1, 2, • • •. From this we obtain the solution of 

(11.7) by composition : 


^2, 


n. 


Vn^-fl, 

0 


if all n' — n 
except = fioL f 1, 
otherwise ; 


^«(«i, «2. • • • : h2> 



if all n' = n 
except n^, = na — 1 , 
otherwise. 


The products x«x„ are of course in diagonal form; x^Xp is the 
matrix r,„p introduced above, and (11.8) coincides with (11.4) : 
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the method of second quantization leads to the same result as the 
method of composition supplemented by the “ symmetric reduction ” 
to [5R”]. But now the number 

^1 + ^2 + ' • • — n 

of individuals is not prescribed ; however H is reduced into 
sub-matrices in accordance with the various values of n, for 
all components H(nin 2 • • * ; * * ’) which n\ + + 

• • . = 1 = ^2 + * • * vanish. The total number of photons 

is not conserved, and to this extent Maxwell’s equations do not 
fit completely into the quantum-theoretical picture — unless we 
wish to consider ** non-existence ” as a particular quantum 
state of the photon. 

The method of composition remains applicable in the presence 
of interaction between the individuals, provided it is an in- 
stantaneous action at a distance determined by the simultaneous 
values of the canonical variables of the various individuals. 
But it breaks down when, as in the theory of relativity, account 
is taken of the finite velocity of propagation, which led to the 
introduction of continuous fields in the classical theories. The 
difficulty arises from the fact that the wave function ip must 
contain the one time t as argument in addition to the spatial 
co-ordinates of each particle, whereas the theory of relativity 
requires that the proper time of each particle appear as argu- 
ment in ip as well as the spatial co-ordinates. The method of 
second quantization shows its superiority in dealing with such 
problems. 

As we have seen, the method of second quantization in 
accordance with Heisenberg’s commutation rules is equivalent 
to a reduction of the system space to Since we have 

seen in II, § 13 that this leads to the correct laws of radiation 
phenomena, we must conclude that the behaviour of photons 
corresponds to this reduction. But in the case of electrons the 
reduction is to the space {5R"}, and we must now investigate 
to what kind of quantization this corresponds.^® The vectors 
of the unitary space {91”} are the anti-symmetric tensors with 
components 

x{<Xj, <X 2 , • • •, a„} ~ \x^^, x„^, • • •, x„J (11.9) 

in the space 9?, where the one row in the determinant stands for 
the n rows formed in the same manner from n vectors j == 
j( 2 )^ . . j(n) of gf \Ye can obtain the totality of linearly 
independent components by restricting tlic indices by the 
condition 


y.i < a-i < • • • < a„. 


( 11 . 10 ) 
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We now denote (1L9) by ^(n,, n^, * • •)> where = 1 or 0 
according to whether a appears in the set of indices aj, aj, • • •, 
a„ or not ; these (juantum numbers may thus only assume 
one of two values. On replacing a, = a in (11.9) by an index 
)3 =b a, (11.9) vanishes if /3 is equal to one of the remaining 
indices cf.^, • • •, a„ ; if ^ is different from aj, • • •, a„ it becomes 

• • • a„} ± i/r(nj, • • •, — 1 , • • •, + 1 , • • •), 

the sign ± 1 being (— 1)'' where r is the number of indices in 
the set a 2 , • • •, lying between a and : 

r Znx 

A 


where the sum is extended over all indices A between a and jS. 
We again obtain equations of the form (11.4) ; (11.5) is then 

valid as it stands but (11.5') is to be replaced by 

'L, • • • ; »'i, • • •) - - ± 1 or 0, 

where the first alternative applies to the case in which all n' = n 
except n, = 1, n\ = 0 ; — 0, n), = 1, the sign being again 

determined in accordance with tlie above rule. On writing 
a matrix |!a(nH')|! in the form 

I rt(0 0) a(0 1) 

L(1 0) a(l 1) 

and introducing the abbreviations 


1 0 
0 1 

wc may write 

7},«=lxlx • • • X ^ ^ 
rf„0 = ixiX • • • X 


= 1 , 


1 0 
0 - 1 



X 1 X 1 X • • •, 


xl'x • • • xl'x 


0 1 
0 0 


xlx • • • (a4:/3), 


where the matrix that is written explicitly in the first equatior 
is in the a*** place and those in the second in the a*** and 
places respectively. We must now attempt to write these 
matrices in the form Xp ; this can in fact be accomplished b> 
taking 


X, - 1' X 1' X • • • X 1' 
= 1' X 1' X ■• • • x 1' 


0 1 



X 1 X 1 X • • •, 

0 0 


0 0 


1 0 

X 1 X 1 X • • •, 
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the small explicit matrices being in the a*'*' place. x„, are 
Hermitian conjugates, and H can now be written in terms of 
them in the desired form (11.8). Instead of the commutation 
rules (11.7) we now have 

Xp + XpX„ = 0, x„ Xp +XpX„r= 0, x„ Xp + Xp x,= h^p. (11.12) 

(11.1) is the irreducible solution of these equations by a pair of 
Hermitian conjugate matrices x*, x^ which are so normalized 
that Xa x„ is a diagonal matrix. 

In order to show that the equations (11.4) for the vector ^ 
in system-space yield the Hamiltonian equations (11.6) for the 
forms 

Xa == EXc.{n ; n') ip{n) t(i{n') and x„, 

n,n' 

we must prove that the formula 

x„ H - H x„ = ^ 

dX^ 


employed in II, § 11, holds here as well. We find that it does 
not hold for an arbitrary polynomial H in but that it 

does for even polynomials in general and so in particular for 
the Hermitian form (11.8). For we have, for example, 

Xi x« Xp =- 8„ x^ - X, Xi Xp == 8,« Xp + X, Xp Xj, 

whence 

Xi -x^Xp- x^Xp Xi = 8,„x^, XiH - Hx, = EH^pXp. 

ff 

On introducing real quantities, i.e. Hermitian forms, p«, 
by 

Xa=\ {q« + iPc), ^ (9« - ip«) 


and denoting the .set pj, ; p^, q^ ', • • • straight through by 
Pit p 2 i Ps) p 4 ) • ■ ■ we obtain the relations 

P» = 1, P«P/J + P/»P« = 0 (a^-^) (11.13) 

The pa are not only Hermitian but unitary as well, as can be 
seen from the first of these equations or directly. Here again we 
meet the matrices 


0 1 


0 — i 

1 0 

} 

A 0 


which occurred in connection with the spinning electron. 
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We have thus discovered the correct way to quantize the 
field equations defining electron waves and matter waves. 
Here again we find, as in the case of the spinning electron, that 
quantum kinematics is not to be restricted by the assumption 
of Heisenberg’s specialized commutation rules. 


§ 12. Quantization of the Maxwell-Dirac Field 
Equations 

The field laws arise from a Hamiltonian principle which is 
analogous to the Hamiltonian principle of classical mechanics. 
This latter is expressed in terms of a Lagrangian function L 
which depends on the positional co-ordinates and their de- 
rivatives with respect to time, and asserts that the first 
variation of 

q,)dt ( 12 . 1 ) 


vanishes when the qi are assigned arbitrary infinitesimal incre- 
ments 8^1 which vanish outside a certain finite time interval. 
This principal yields, on integration by parts, the differential 
equations 


t + = ~ 


Defining 
and noting that 


// = L + Pi 


8L = ZLM, - ZpMi 


we obtain for the differential of H the expression 

% i 

Expressing H 3,s 3. function of the qi and the generalized momenta 
pi associated with them, we have 


and by ( 12 . 2 ) these are just the Hamiltonian canonical equations 

dqi ^// dpi 'dH 

dt 'dpi dt dqi 

In quantum theory the pi are operators satisfying Heisen- 
berg’s commutation rules. 
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This reasoning can be carried over without difficulty to the 
case of a continuum, as appears in field theories. On replacing 
for the moment the 3-dimensional space by the 1-dimensional 
interval 0 ^ x ^ I described by the co-ordinate x and assuming, 
for the sake of simplicity, that only one state function q — q[x^ t) 
is involved, the integral (12.1) is then to be replaced by 

1 

{{^<7, q)dxdt. 

0 


Naturally L may depend on the spatial derivative or even 

higher derivatives, in addition to q. The continuous variable 

X takes the place of the index i and the Lagrangian function, in 

1 

the sense of (12.1), is now the integral ^L{q, q)dx with respect to 

0 

the spatial variable instead of L itself. We first replace the 
continuum by a discrete set of equidistant points defined by 
i , 

— - (i = 0, 1, • • •, n 1). The differential quotients with 

respect to x arc naturally to be replaced by difference quotients 
with the difference A;t: — 1 n, and the integrals become sums. 
In accordance with the outline above we must now set 


Pi 


q) 

^q 


Ax, 


calculated at the point x = tin. For the continuum we have 
analogously to set 

_ iL{q, q) 


and H is to be defined by 

1 

//=/>+ ^qpdx, 
0 


The commutation rules which are satisfied by q, p in quantum 
mechanics cause some trouble. As long as we employ the 
discrete set of points in place of the continuum they are 

q{x) p{x') - p{x') q{x) = • 8^^- 

ax 

where x, x' run independently through the set ijn and 8^^. is 
1 or 0 according as x' coincides with x or not. For fixed x' 

A • Kz' == - x') 
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is a function of ;t: which vanishes for all values of the argument 
other than :r' and is there so large that the sum Z'8(x ~ x') • Ax 

has the value 1. In dealing with the continuum we therefore 
introduce with Dirac a function 8{x — x') which vanishes at 
all points x ^ x' and is so large at the point x' that its integral 
has the value 1 (cf. I, § 7). Of course there exists no such 
function, but it can be “ arbitrarily closely approximated ” by 
a function which vanishes everywhere except in a very small 
interval about x' and assumes very large values within this 
interval. Only in this sense can we perform the passage to 
the limit Ax ^ 0 and write the commutation rules symbolically 
in the form 

q{x) p{x') - p{x') q{x) = i 8(.v - x'). (12.3) 

A good illustration of the mathematical interpretation of 
this pathological function 8[x — x) arises in the theory of ortho- 
gonal sets of functions for with its aid the completeness 

condition may be formulated 

rto <i>,{x') B{x ~ X'). 

i 

This is literally correct as long as x only runs through a discrete 
set of points, but the rigorous mathematical formulation for 
the case of a continuum is given by 

1 

2^ ^,(x) (f>j{x') • Ii{x) v{x') dx dx' " \u{x) v[x) dx 
^ 0 0 ' 1 0 

where ti{x)^ v{x) are any two continuous functions in the interval 
(0, 1). Hence from the more rigorous standpoint (12.3) must 
be replaced by the equation 

1 1 1 
j ^u[x){q[x) p{x') — p{x') q{x)}v{x') dx dx' = i^u{x) v{x) dx 
0 0 0 

containing two arbitrary functions u(x), v[x) ; furthermore, it 
is to be noted that the p, q in the brackets are first to be replaced 
by approximations q^^)—e.g, by the partial sum of 

their expansion in terms of orthogonal functions — and the 
passage to the limit n -> oo is to take place after ^ not before, 
the integration. This interpretation offers a sound mathematical 
method of dealing with the relation (12.3). It is to be emphasized 
that (12.3) refers to two points of space x^ x' at the same moment 
i.e. in a section of the world in which t = const. ; the arguments 
of q and p are to be written more precisely as [x^ /), [x\ i) re- 
spectively. 
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On applying this general scheme to the action 

lT-M + iW' + -F, (5.18) 

from which the field equations for the electron and for the electro- 
magnetic field are obtained, we find ourselves faced with a 
difficulty arising from the fact that the Lagrangian function 
does not contain the time derivative of the scalar potential /o, 
for the generalized momentum associated with /# then vanishes 
identically and cannot possibly satisfy a commutation relation 
such as (12.3). We avoid this difficulty for the moment by 
utilizing the principle of gauge invariance to remove /o from the 
expression of the Lagrangian function by setting it equal to 0 ; 
this device has already been employed in II, § 13. The set of 
independent functions describing the state is then 

^ — ('/'ii ^2) ^3. ^ 4 )) f = (/i. fz, /a)) 

where we have written ^3, ^4 in place of ^2- The momenta 
associated with these quantities are then found to be : with 

^p and — Ej, with /p. The commutation rules which are to be 
applied in quantizing the field equations are accordingly 

UP)UP')+MnUP) = K^-HP--P') k<r==l,2,3,4], (12.4') 
/p(P)£,(P')-£,(P')/p(P)==i8p,-8(P-P') [/>, ^^1,2,3], (12.4") 

where P and P' are any two points of the same spatial section 
t = const. We have here taken account of the fact that the 
quantities ijj describing matter are not to satisfy Heisenberg’s 
commutation rules, but are instead to satisfy those obtained 
by replacing the minus sign which occurs in them by a plus 
sign. These rules must be supplemented by the assertion that 
the ifjf, satisfy in addition the equations 

up)up') + up’mp) - 0 , ( 12 . 5 ) 

and the same for ^p ; that the /p at any two points P. P' are 
commutative and the same for the Pp ; and finally that the 
material quantities ip, ip on the one hand and the electromagnetic 
quantities /p, Pp on the other are kinematically independent, 
and that every quantity of the first kind at a point P commutes 
with every quantity of the second kind at any point P' (in the 
same section t = const, of the world). 

As in II, § 13, we again consider the whole system enclosed 
in an insulated and perfectly reflecting cavity which is at rest. 
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In order to describe the electro-magnetic potentials we make 
use of a complete orthogonal set of solutions f of 

Af + = 0 (12.6) 

in the cavity, whicli satisfy the conditions 

div f “ 0, f normal 

at the walls. The construction of such a system is readily 
obtained from the Gauss divergence theorem 

|(curl f • curl g + <Iiv f • div + f * ^Q)dV 

= f(ff. curl g]„ 4- f„ div g) do {n denoting normal component) 

for the vector [f, curl fl] + f div g, f and g being two arbitrary 
vector fields. We first determine the scalar functions 
which satisfy the equation A^ + ~ 0 and vanish on the 

walls, and from them construct the vector fields = gi'ad <f>x ; 
these vectors automatically satisfy the conditions above, 
are of course mutually orthogonal and can be normalized in 
accordance with the equation 

f(fA . - 8,v[- 

We also determine a complete normal orthogonal system of 
solutions of (12,6) which are normal to the walls but which 
satisfy the condition div — 0 everywhere, not only at the 
walls. The are then orthogonal to these f,, and they con- 
stitute together a complete orthogonal system for vector fields 
in the cavity. We may consequently write 

f — + Epx]k\ 

ZpA.- ZqAA 

»» k } 

in the section t — const. The fy are vectorial functions of 
position in space and have as values ordinary numbers, whereas 
the p, q are scalar quantum mechanical matrices which are 
independent of position and which satisfy the commutation 
rules 

qv pv Pr qv Pk pk qk ^ > 

all q commute among themselves and all p among themselves, 
and any p commutes with any q whose index is not the same. 
[These rules are perhaps most readily obtained by solving 
(12.7) for the “ Fourier coefficients ” p, q in terms of integrals 
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of scalar products of f, @ with fa, and applying the commuta- 
tion rules (12.4).] The energy 

of the electro-magnctic field becomes 

We already know the solution of the commutation rules which 
reduces this expression for the energy to diagonal form. The 
individual components of the vector on which the />, q operate 
are distinguished by means of the quantum numbers corre- 
sponding to the V, and the values of the continuous variables 
corresponding to the A. On setting q„ ^ 
operator which affects only the index in accordance with 
the equations 

Q.{N., iV. - 1) = a(iV., A. -f 1) = ; 

all other components, corresponding to transitions A„ ^ Nl 
in which Ni is neither ± 1, vanish. A„ assumes the integral 
values 0, 1, 2, • • • and can be considered as the number of 
photons of the kind v. The momentum px associated with the 
continuous variable qx is, following Schrodinger, represented by 

the operator t The electro-magnetic energy is then in 

diagonal form and, on neglecting the (infinite !) null-point 
energy, multiplies the vector component {N, ; qx) with 

nZqi- ( 12 . 8 ) 

•• ^ X 

We thus see how it happens that the electro-static part, which 
is described by the continuous variable qx, is separated off from 
the part due to the radiation, described by the discrete N„ 
giving the number of photons of kind v. 

The ifi appear in the part of the energy due to matter only 
in combinations of the form Consequently it will be found 

advantageous in dealing with electrons to apply the method 
of composition followed by anti-symmetric reduction ; we have 
shown in the preceding section that this procedure is equivalent 
to quantizing in accordance with the rules (12.4'). Since the 
electro-magnetic quantities commute with the ^p, ^p they may 
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here be considered as ordinary numbers. The quantized wave 
equations then refer to a “ vector ” j with components 

Zp, . . . P„(^l ' ' ■ Pn\ qx), 

where P\, ' ' Pn the positions of the n electrons and 
Pi, * * \ Pn their spin variables, each of which runs through 
the four values 1, 2, 3, 4. We write . p^ as a column 
consisting of 4" terms ; this z is anti-symmetric with respect 
to a permutation affecting the Pr and pr alike. ^ {S[^\ 

^ 2 ^^ ^ 3 ^) is the spin vector (5i, 5*2, S^) operating only on the 

index pr, is similarly the operation on the index pr 
which interchanges ifj 2 with (//j, and grad^^^ is the gradient 
with respect to Pr. The part of the Hermitian energy operator 
— in the equation 

which depends only on matter is 

f (©<'>, i gracU> + VaXe • UP,) + ) ^ grad UPr) ' ~) 

+ (12.9) 

r = l 

and to this must be added the electro-magnetic part (12.8). 

Since we have throughout taken the scalar potential ~ 0 
we have lost the equation 

div (S + p == 0 (12.10) 

arising from the variation of /q. This equation contains no 
derivatives with respect to time, and consequently represents 
a condition on the state of the field at a moment t ~ const. ; 
we must naturally take it into account. On substituting the 
value of (£ from (T2.7) we obtain 

Z!qx A<^a + P = 0 

A 

and on multiplying with <^a and integrating over the space under 
consideration 

qx — lp<f>xdV = 0 . 

From the standpoint of quantum mechanics the left-hand side 
of this equation is an operator Dx, and the meaning of the 
equation Dx = 0 is that only those vectors 3 which satisfy the 
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equation 0x^ = 0 are to be allowed. Dx also consists of an 
electrical part qx and a material part 

^p^xdV = + ^ 3^3 + 

The operator Dx which is to be applied to 5 is accordingly 

Dx = qx~ iUPr)^ 

r= 1 


The equations Dx^ — 0 then assert that all components 

n 

z{Pr ; Ny ; qx) of j vanish except those for which qx — E^x{Pr ) ; 

r ^ 1 

we may therefore write the non-vanishing components as 
4,{Pr ; N,) = z[P , ; ; EUPr)\. 

f = 1 

But then 

grad<''> tp =. grad'’’* s + 2* grad (f>x{Pr) ’ ^ 

is exactly the combination which appears in (12.9). Eq\ is 
now given by . ^ 

S SUPr)UP^) = EG{Pr, P,) 

r, « = 1A r, «=1 


where 


G{P,P')=-- EUP) UP 


is the ordinary Green’s function for the cavity. We conse- 
quently obtain the quantum equation 


1 

i dxQ 

for i/f, in which the operator 

” fl 




= grad(^>) -f thoDA + 5 f G(P„ P,) 

r = lli j Zr.« = l 

+ E^K + VaZ UPr)) . Qr). (12.11) 

V p r — 1 


i(5, grad) + ntoT 


In Dirac's theory 
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is the energy operator for a single free particle, a G{P, P*) is 
the classical potential due to the electro-static repulsion be- 
tween two electrons situated at P and P\ The next term 
represents the sum of the energies v of the photons in the various 
frequency states v, and finally the last term represents the 
interaction between photons and electrons by emission and 
absorption. The meaning of each of the terms from which 
the energy operator (12.11) is constructed is thus apparent. 
The quantum theory had previously dealt with fields, such as 
that which binds the electron in hydrogen to the nucleus, in 
a manner entirely different from that with which it treated the 
field of the emitted radiation ; the first was calculated classically 
and purely electro-statically as an action at a distance described 
by the Coulomb potential, whereas the second was broken up 
into discrete photons with the aid of Bohr's frequency condition. 
We have now obtained a theoretical justification for this pro- 
cedure which led to good agreement with experiment. 

Our expression shares with classical electro-dynamics the 
disadvantage that it contains the term G{Pr^ Pr) representing 
the infinitely large reaction of the electron with itself, for 
as we allow P' to approach P, G(P, P') becomes infinite like the 
reciprocal of the distance PP\ We should therefore replace 
G(P, P) by the finite P (P, P) where 


P(P, P') - G(P, P') - 


1 

\it-FF' 


for this amounts to dropping an infinitely large additive con- 
stant from Jq. P{Py P) represents the effect on an electron at 
P of the field obtained by reflecting the field of P in the walls 
of the cavity. (12.11) shows explicitly how the various terms 
of Jq depend on the value of the fine-structure constant a ; on 
developing the solution in powers of a we are faced again and 
again with infinitely large terms of the same kind as G(Pr, Pr)- 
The operator Jq contains singularities which, at the present 
stage, frustrate all attempts to carry through the theory. We 
may indeed conclude with P, Jordan that the problem of the 
existence of the electron is solved, but that that of its con- 
stitiiiion has as yet eluded us. Our equations further suffer 
from the fundamental disadvantage of the Dirac theory that 
the individual spin variables assume 4 instead of 2 different 
values. 

There is, of course, nothing to prevent us from quantizing 
the matter waves in a manner analogous to that applied to 
electro-magnetic waves. We should then develop our quantities 
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describing the material field in a series of characteristic 
functions ifi = (with four components) of the Dirac equation 

(©, grad) + WoT|t/r + W* = 0 ( 12 . 12 ) 

which constitute, on imposing appropriate boundary conditions, 
a complete orthogonal system. The general component z of 
the vector §, on which the energy — c^q operates, will then depend 
on the quantum number which corresponds to the char- 
acteristic values fjL and which may assume only the values 0 and 
1, and in addition on the numbers N^, of photons of the various 
frequencies v and on the continuous variables qx. But then the 
operators Dx, which commute among themselves and with 
are not in diagonal form, and the elimination of qx cannot be 
accomplished as in the above method. 

Instead of introducing a cavity as in the above we may 
employ a rectangular parallclepipedon with the “ boundary 
condition ” that all functions are to be periodic functions whose 
periods are the lengths of the sides of the parallclepipedon. 
We can then introduce running instead of standing waves as 
characteristic functions for the electro-magnetic field ; this gives 
rise to a better agreement with the physical picture in which 
a photon corresponds to a homogeneous plane wave. The 
energy and the ynomenia are then also in diagonal form if we 
neglect the interaction between matter and light. Equation 
(12.10) then causes some difficulty, as its right-hand side 0 
must be replaced by the constant mean value of the charge 
throughout the entire space in order that a periodic solution 
be possible. On taking account of protons in the theory this 
will automatically correct itself, as the total charge will then 
be 0. 

The dynamical law allows only those quantum jumps of the 
particles in which one n^ falls from 1 to 0 and another n^j,> jumps 
at the same time from 0 to 1. Consecjuently the total number 
of particles En^^ and therefore the charge, remains fixed ; hence 

/A 

that portion of the dynamical laws in which the total number 
is a given finite n is separated off from the remaining portion 
and intercombinations between the two do not arise. Dirac 
has proposed to interpret the presence or the absence of a proton 
in the state of positive energy /x as the absence or the presence, 
respectively, of an electron in the corresponding negative energy 
state — /X ; our laws will then include protons as well as electrons.^* 
Remembering that the numbers — 0, 1 were at first intro- 
duced merely as an arbitrary index indicating the rows of a 
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matrix, there is nothing to prevent us from replacing the numbers 

for negative — fx by n” — 1 — keeping for 

positive jjL. The theorem of the conservation of charge is then 

~~ “ const. (/X > 0). 

But we thereby alter the content, as well as the notation, of 
the theory ; we are now interested in that part of the dynamical 
equations in which only a finite number of n,, with positive /x 
are different from 0 and oily a finite number of with negative 
ji are different from 1 ! The quantum jump of an electron 
between positive and negative energy levels, which was so un- 
desirable in the Dirac theory as formulated in the previous 
section, now appears as a process in which an electron and a 
proton are simultaneously destroyed and as the inverse process. 
The assumption of such an occurrence, for which our terrestrial 
experiments offer no justification, has long been entertained in 
atrophysics, as it seems otherwise extremely difficult to explain 
the source of the energy emitted by stars. 

However attractive this idea may seem at first, it is certainly 
impossible to hold without introducing other profound modi- 
fications to square our theory with the observed facts. Indeed, 
according to it the mass of a proton should be the same as the 
mass of an electron ; furthermore, no matter how the action 
is chosen (so long as it is invariant under interchange of right 
and left), this hypothesis leads to the essential equivalence of 
positive and negative electricity under all circumstances — even 
on taking the interaction between matter and radiation rigor- 
ously into account. 

Having now quantized the field equations, we must return 
to the question of how the constituents M, A/', F of the action 
behave under the substitutions (6.12), (6.13), (6.14). The first 
two substitutions, which we may call {a) and [b), have exactly 
the same effect as before. But the third substitution (r), 
which sends the components of ifj over into the components 
of ifj or their negative, now affects M and M' differently, for 
ifj and iff are no longer commutative with respect to multiplica- 
tion — they are, in fact, almost anti-commutative. From this 
it is found that Af, A/', F behave under {c) in exactly the same 
way as they do under (i;), i.e. they are multiplied by the signs 
— , — , -j- respectively. Hence past and future play essejitially 
different roles in the quantized field equations ; we find no sub- 
stitution which leaves these equations unchanged while reversing 
the direction of time. It seems to me that we have thereby 
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reached an extraordinarily important goal of physics. We 
can now obtain the substitution 

(« = 0, 1, 2, 3) I 

on combining (a), [b) and {c) ; this substitution neither affects 
the co-ordinates nor disturbs the quantized wave equations. 
In view of Dirac’s theory of the proton this means that positive 
and negative electricity have essentially the same properties 
in the sense that the laws governing them are invariant under 
a certain substitution which interchanges the quantum numbers 
of the electrons with those of the protons. The dissimilarity 
of the two kinds of electricity thus seems to hide a secret of 
Nature which lies yet deeper than the dissimilarity of past and 
future. 

§ 13. The Energy and Momentum Laws of Quantum 
Physics. Relativistic Invariance 

In quantizing the wave equations the spatial and temporal 
variables were treated so differently that the relativistic in- 
variance of the resulting laws might seem to be open to serious 
doubt. ' But a thorough investigation due to Heisenberg and 
Pauli reassures us on this point.^^ We carry through these 
considerations on our action principle — but in such a way that 
the general validity of the argument may be readily seen. At 
the same time this offers an opportunity to discuss the meaning 
of the quantization more thoroughly than we have done hitherto. 

I. The Energy and Momentum Laivs of Quantum Physics. 

We begin with the 4 -f 3 + 3 operators /p, Ej, which 
are functions in 3-dimensional space satisfying the commutation 
rules (12.4) and the supplementary rules there set forth. There 
exists one, and in the sense of equivalence only one, irreducible 
solution of these conditions. From it we obtain the energy 
density defined by (6.5), (6.6) and integrate it over all of 
space : 

We next construct the “ commutator ” 

% 


(13.1) 
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of an arbitrary operator 0 with Consider the result of this 
for the particular operators 0 — i/rp, /p, £p ; it should be possible 
to evaluate these commutators using (12.4) and the supplement- 
ary rules alone if one of the quantities involved appears as a 
derivative with respect to a spatial co-ordinate it should be 
transformed by integrating (13.1) by parts — or by deducing 
commutation rules for it from (12.4) in terms of appropriately 

defined derivates of the S function. If is that process 

involving only differentiations with respect to the spatial vari- 
ables, but which coincide with the derivative with respect to 
time in virtue of the Maxwell-Dirac field equations, we find 

8 /', = ^. (13.2) 

We noiv drop the normalization [q = 0 . It follows from these 
equations that 80 for any gauge invariant operator 0 coincides 
with its time derivative as defined in terms of its spatial deriv- 
atives by means of the field laws. We may therefore replace 
the Maxwell-Dirac field equations by tlie quantum mechanical 
dynamical law 

7 , 

3 represents the probability state of the physical system (pure 
state !) at the time ; it is a vector of that vector-space in which 
our operations take ])lace. The fundamental concepts here 
involved arc contained in the general [)rogrammc of quantum 
mechanics as set forth in II, § 7. The “ density of electricity 
at the point P ” is, for example, represented by the operator 
P ~ H — i — h which is independent of time, d'he changes 

in the probability distribution for this physical (juantity in 
course of time are due to the changes in the state 3 and not to 
changes in p itself ; the rule for the calculation of this probability 
distribution from p and 3 is given in the general programme 
referred to above. The same remarks apply to any gauge 
invariant quantity 0. However, it is more desirable to con- 
sider the “ density of electricity ” (without specifying either 
time or position) as a fixed physical quantity represented by a 
definite operator p, and to ascribe the variations in its prob- 
ability distribution in time and space to changes in the prob- 
ability state 3 considered as a fimction of the spatial co-ordinates 
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^1, ^ 2 . ^3 addition to the time Xq. We should then expect to 
find four equations 

= 8 (a = 0, 1, 2, 3) (13.4) 

in place of the one (13.3) in which the operators 

7.- ItUV 

are those representing energy and momentum. Only now that 
we have formulated the general scheme of quantum physics 
in a manner which is symmetric with respect to the spatial 
and temporal co-ordinates, as required by the theory of relativity, 
can we consider it as complete. In order to determine the 
mean value of a quantity such as the electric density p we must 
assign to the spatial co-ordinates Xi^ X 2 , x^y on which the operator 
p depends, any definite values x^ (e.g. 0 ). The spatial com- 
ponents of equation (13.4) tell us that the replacement of {x^) 
by a neighbouring point {x^ + dx^ amounts to the same thing 
as subjecting the normal co-ordinate system in system space, 
to which the vectors 3 are referred, to the infinitesimal rotation 

i{JidXy^ + J^dx^ + Jzdx^). 

We must not forget that the equation (13.3) is not equivalent 
to the complete set of field equations, for we have omitted the 
one 

a[P) ^ div 6 + p = 0 

which does not involve differentiation with respect to time. We 
must therefore restrict ourselves to vectors 3 which satisfy all 
the equations 

a{P)i - 0. (13.5) 

These equations define a linear sub-space 91^ of the original 
system-space The operators g{P), a{P') associated with any 
two points P, P' of space are commutative : 

a{P) a(P') - a(P') a(P) = 0 . 

It is of prime importance that or{P) commute with Jq^ i.e. that 
Sct = 1(3^0 or — aJo) = 0 ; 

that this is the case follows from the fact that the equation 
— = 0 is a consequence of the remaining field equations in 

^Xq 

the classical field theory, and consequently — independently of 
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our field equations — we may conclude that the gauge invariant 
operator a satisfies the equation Scr = 0. This commutativity 
of g{P) and Jq guarantees that the infinitesimal rotation 
of system-space during the time interval dx^ does not carry the 
vector 3 lying in the sub-space 9lcr out of 91^. 

Continuing our programme, we now set 

and investigate the “ commutator ” 

80 = [y„ 0 ] 

of an operator 0 with ; we shall denote this commutator by 
§1 whenever confusion might arise between it and the commutator 
8 — with We find the equations * 

¥p = ; S/i = 0, 8A = //a 

8E, 


8Ei 


/ ^^2 
\8a::2 


4- 

8 X 3 


+ P, 




8 X 2 ’ 
8/42 . 

8;Vi’ ’ 


SE, 


(13.6) 

From this it follows that for any gauge invariant quantity 0 
80 

we have 80 = — on taking the equation a — 0 into account. 

uX j 

Hence the way in which gauge invariant quantities depend on 
the spatial co-ordinates can in fact be described as we predicted : 
the operators representing them are constant, but the vector 
3 representing the probability state varies in space in accordance 
with the equations (13.4) for a ~ 1, 2, 3. 

That the four equations (13.4) are consistent also follows 
from these considerations. In the first place we have 

S^a-O or o{P)J^-J^g{P)^0 

in the entire space 91 ; this follows from (13.6). In the classical 
field theory the differential conservation theorem 




+ u.. + 


0 i;a2 ^^3 


4 - 

4)44 'dX. 




is a consequence of the field equations. Since is a gauge 
invariant, it follows that after the quantization the operators 
satisfy the relation 


+ ( 


8a:i 8a;2 8X3 


) 


0 


♦ In contrast with (6.2) we now employ the letter Jp, without the factor 
1 /a, as an abbreviation for curl f. 
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in the space JR, defined by (13.5). Integrating over the space 
Xq = const, we obtain 

= 0 or ^ 0^1 (13.7) 

[The equation which takes the place of (13.7) for the entire 
space 91 is 

Furthermore, 


in 9f,„ and on integrating this over space we find 

= 0 or = 0. 

We thus see that the operators are commutative in and 
consequently equations (13.4) possess one and only one solution 
5 when the initial value of 5 (i.e. at the origin of the space-time 
co-ordinate system) is a given vector in 

11. Relativistic Invariance. 

On transforming from the normal co-ordinate system in 
space-time to another x[^ by means of a Lorentz transformation 

3 

the solution of the equations 

= (13.4') 

is, as we shall show, obtained from the solution of (13.4) by 
means of a unitary transformation U induced in system-space 
by A. That is, there exists a unitary transformation U such 
that 

- = {zyjx'MVi) 

Z A 

is satisfied in virtue of (13.4) : 

IJ- Ey.dx^=- Z7^dx-,-U 

A • 

or 

uy^=^ Zop.7r u. (13.8) 
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We could also say that (13.4') have the same solution 3 as (13.4) 
but that the normal co-ordinate system employed in system- 
space has undergone the unitary rotation U, for the vector f/j 
has the same components with respect to the new co-ordinate 
system as 5 had with respect to the old. We are only able to 
give the transformation U explicitly for infinitesimal A : 

IM = 1 + ; f/=l + i8M. 

The equations (13.8) which are to be verified are then 

= [ 8 M, yj. 

fl 

In particular, the operators in system-space which correspond 
to infinitesimal rotations in physical space are, as we have 
long known, those representing moment of momentum ; that 
8 M corresponding to the infinitesimal rotation ' 

8 xo = 0, 8 a;i = 0, 8^2 = — 8 : 1:3 = X 2 (13.9) 

about the ;r,-axis is the Xi-component of moment of momentum ; 

{M, - )M.23 = f(X 2 /^ - x4)dV. (13.10) 

The infinitesimal Lorentz transformations which actually repre- 
sent a re-partitioning of the world into a new space and a new 
time are dealt with in exactly the same manner ; it will suffice 
to consider as typical of such transformations 

8xq ~ Xi, 8x1 A^o, 8x2 = 0 , 8 a :3 = 0 . 

The 8M associated with this transformation is 


Mio = \xilldV + Sx,i\dV-. 

the second term, which vanishes for x^ = 0 , can be omitted, 
for we have already shown that commutes with all This 
term does not fit into the present scheme, in which all the 
operators are functions of x^, X 2 ^ x^ alone. Our problem is thus 
reduced to showing that in 


[^ 23 , 7 «] = 0 , 0 , 

= -7x,- 7o, 


7,, 

0 , 



for a = 0, 1, 2, 3. 


(13.11) 

(13.12) 


Furthermore, the invariance of equations (13.6) which define 
the sub-space 91,, will be proved by showing that the equations 

[^23, «^]=0, [M,o, a]=0 (13.13) 


hold in the entire space 91. 
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In order to prove (13.11) we make use of the identities 

= 0 [a=l, 2, 3]. 

Introducing the Kronecker 8 ,t, the integrand may be written 



In consequence of ct — 0 and since t — l?^, are gauge invariants 
the operations 

may be replaced by 8 ^/ = [J^, t], 

whence 

( 8.2 % - 8„3 7,) I- 8 J {X, tl - X, ttidV = 0 
or 

KM,, = [J., Kz 7^ - 8.2 7, [a - 1, 2, 3J. 

In the classical field theory the conservation law 

2^ <^(^2 ^3 X, (,) Q 

« = 0 7>x„ 

is a consequence of the field equations, whence on quantizing 
8 o(;r 2 - ;r, /«) + i = 0 

a = 1 

holds identically in Integrating over the whole of physical 

space we obtain 

^0^23 “ [^0) ^23] “ ^ ! 

equations (13.11), i.e. 

[M^,7.] = K273-K3y2 [a-0, 1, 2, 3], 

are thus completely verified. 

The relations (13.12) are obtained in an analogous manner 
from 

[fora-1, 2, 3] 

and from the equation 

f{8o(;ru2) + - 0 
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jvhich parallels the conservation theorem 


1 ^(^1 ^0 “F ^0 ^ l ) 


'dXn 


dV = 0 


of the classical field theory. 

We should expect the operator functions expressed by the 
•Api /pi depending on the spatial co-ordinates, to be in- 
variant if we associate with an infinitesimal rotation of the 
spatial co-ordinate system an appropriate linear transformation 
of the components i^p among themselves and of the vector 
components fp, Ep, and at the same time subject the normal 
co-ordinate system in system space to the corresponding 
unitary transformation. In formula; : We expect the process 

80 = |M„, 0] 

to yield the equations 

H 

S/p = S'/p 4- (Spj/s — Spj/j), 

8Ep = h'Ep -b (8p2 E^ — 8p3 £2), 


where we have written 



'bxi 


But we find by direct calculation that 
8i/» = h'ljf + i{x.j3 — 

S/i = ^2 H2 d” ^3 f^ 3 i S/2 = X2 Hi, 8/3 = x^ Hi, 

8£p h'Ep 4- hpj{E3 4- .tj a) — hpsiEj 4- x^a). 

We first observe that these equations yield 

8a = [M23, a] — 0 

independently of the condition a — 0. On introducing the 
condition a = 0 we find from these equations that gauge in- 
variant quantities 0 exhibit the expected behaviour. The 
second of the equations ( 13 . 13 ) can be obtained by an analogous 
computation. 
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D. Quantum Kinematics 

§ 14. Quantum Kinematics as an Abelian Group of 

Rotations 

If we consider the operators ip, iq as infinitesimal unitary 
rotations of the ray field in system space, then Heisenberg’s 
commutation rules [II, (11.4)] assert that these rotations are 
commutative ; consequently they generate a 2/-parameter 
Abelian group, where / is the number of degrees of freedom. 
Let us therefore investigate the properties of Abelian groups 
of unitary rotations in the ray field of n-dimensional space ! 
On introducing a gauge as in III, § 16, to each such “ rotation ” 
there corresponds a transformation of vector space with matrix 
A and between any two matrices A, B there exists an equation 
of the form 

AB = zBA. (14.1) 


This equation is possible only if e is an root of unity, for on 
evaluating the determinant of both sides we obtain s" = 1. 
From (14.1) we obtain by mathematical induction 


A>^B = 

AB'- e'B'A, ] 


(14.2) 


for /u, / = 1, 2, 3, • • *. On combining these two equations by 
applying the second to and B instead of A and B we find 
the general rule 

A^B^ - b^^B^AK (14.3) 


Taking k nm (14.2) we are led to the equation A^B = BA^ ; 
if the Abelian rotation group is irreducible Schur’s fundamental 
lemma allows us to conclude that since A^ commutes with all 
elements B of the group it must be a multiple of the unit matrix : 
A^ — 1. The order of any element of an irreducible Abelian 
rotation group in n dimensions is consequently a factor of n. 

An /-parameter continuous rotation group is generated by 
an /-dimensional linear family g of infinitesimal unitary corre- 
spondences 

C7| Cj + + • • •-\r<yfCf (14:.4) 


in terms of a basis formed by any / independent elements 
Cl, C 2 , • • C/ of the family. The numerical parameters 
(7i, CT 2 , • • (7/ may assume all real values. Setting Ci = a* dr 
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and reiterating the infinitesimal transformation (14.4), we find 
that at “ time ” t the resulting transformation is 

(j^) = + + + (14.5) 

where wc have replaced a.r by a,. U runs through the entire 

group, which is now expressed in terms of the parameters a. 

If the group of unitary transformations of the vector space is 

Abelian the C, must satisfy the conditions 

=- 0. (14.6) 

From this it then follows that all the elements (14.5) of the 
group are mutually commutative, for if AB — BA — Owe have, 
as in the domain of ordinary numbers. 


The parameters a in (14.5) are added on composition : 

f7(ffi, • • •, af)U{a[, ■ • •, a}) = U{a^ + a|, • • •, Of + a'f). 

If, however, only the rotations of the ray space are commutat- 
ive, we find in place of (14.6) conditions of tlie form 


^ pi ^ V ^ ^^>^4 ) 


where the constitute an anti-symmetric system of real numbers. 
The commutator of the inlinitesimal transformations with 
matrices 


A — CfyCi + ' * * + ^ + • • • + TfCf 

is 

AB — BA = • 1 . 

shall refer to the anti-symmetric form 

= h{a, t) 

as the commutator form ; it is invariant under change of basis. 
A B 

On writing 1 + — , 1 + ~ (14.3) in place of A, B and allowing 

k — l = m^oo, we find that the commutator of any two 
elements U{ai, ctj, • • •, a,) = U{a) and U{t) of the group is 

U(a) U{r) U-\a) U-^{t) = e[h{a, t)] • 1 . (14.7) 

If the rotation group is irreducible a fixed U{a) can only 
commute with all U{t) if it is a multiple of the unit matrix, 
i.e. if all its parameters a vanish. From this we conclude that 
the commutator form is non-degenerate, i.e. that it cannot 
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vanish identically in t,- for a fixed set of values o-,, unless all 
CT, = 0 — this amounts to the same as the condition 4= 0. 
Such a form exists only if the number / of variables is even, in 
which case it can, by appropriate choice of the basis (i.e. by 
transforming the variables a, and t, cogrediently under an 
appropriate transformation), be reduced to the canonical form 
in which the matrix i|c,,:|| is decomposed into 2-rowed sub-matrices 


0 

1 

1 -1 

0 


arranged along the principal diagonal.* It is then desirable to 
write 2/ in place of / and to denote the “ canonical basis ” so 
obtained by 

fa (v = 1, 2, • - •, /) 

and the corresponding parameters by Oy, t,. The factor i has 
been introduced in order to express the results in terms of 
Hermitian operators P,, Q,. The basic elements then satisfy 
the commutation rules 

iiPM, - QvP.) - 1, - QrP.) = 0 

for /X 4= V and 

P,Py - aP, = 0, Q,Qy - QyQ, = 0 
for all /X, v. The elements 

U (a) = e(aiPi -f CTjPj + * ' ’ 4- f^/Pf) [e{x) = f'*] 

then constitute an /-parameter Abelian group of unitary (vector) 
correspondences, as do also the 

J/(t) = e(riQi -f T^a + ' • • + T^a)- 

But the commutator of elements O'(cr), K(t) belonging to these 
two sets, respectively, is 

[J(a)V(r}U-^(a}F-^(r) = e(a,r, + • • • 4- a,r,) ■ 1. 

We have now carried our development to a point where we 
can profitably return to the considerations of II, § 11. In 
the case of a system with one degree of freedom in classical 
mechanics any physical quantity associated with the system 
is expressed mathematically as a function /(/>, </) of the canonical 
variables p, q. In making the transition to quantum mechanics 
we had previously restricted ourselves to polynomials in p, q. 
But the Fourier representation 

■)-00 

^(p> = j + rq) i{a, r) da dr 

— 00 

* See Appendix 3. 


(14.8) 
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of a function / is applicable to a much larger class of functions ; 
this integral need not be interpreted literally, the essential 
point being that it represents a linear combination of the simple 
functions e{(7p + rq). On considering ip, iq as infinitesimal 
unitary correspondences in ray space which are commutative 
in accordance with the relation 

i{pq - qp) = 1, (14.9) 

e{ap + rq) runs through the group generated by them. If we 
now consider ^{a, t) as the components of an element in the 
resulting group algebra, then (14.8) is its group matrix in the 
representation obtained by associating with (a, t) the unitary 
transformation e{ap + rq). 'I'his group matrix is Hermitian if 
the element is real, i.e. if 

Ik t) = cr, — t). 

A quantity / is consequently carried over from classical to 
quantum mechanics in accordance with the rule : replace p and 
q in the Fourier development (14.8) of f by the flermitian operators 
representing them in quantum mechanics. In particular, the 
derivatives of f are represented by 

+ CO 

fv = i\\e{ap + rq) • a^{a, t) da dr, 


4- 00 

/« = + ^4) • T r) da dr. 

— 00 

On letting U{t) in (14.7) again in infinitesimal we find, with 
the aid of the commutation rules (14.9), tliat 

p • e{ap + rq) — e{ap -j rq] ■ p t ■ e{ap -\- rq), 
q ■ e{ap -1- rq) ~ e{ap + rq) ■ q ■ e{ap f rq) . 

We therefore have in general 

- if, =- p • f - f- p 


as required in order that the Hamiltonian 
dq _ r, dp 
dt ~ dt 


= - // 


equations 

<1 


be equivalent to the quantum-theoretical equations of motion 
for the vectors of system space 

We have thus found a very natural interpretation of quantum 
kinematics as described by the commutation rules. The kine- 
matical structure of a physical system is expressed by an irreducible 
Abelian group of unitary ray rotations in system space. The real 
elements of the algebra of this group are the physical quantities of 
the system ; the representation of the abstract group by rotations 
of system space associates with each such quantity a definite 
Hermitian form which “ represents ” it. If the group is con- 
tinuous this procedure automatically leads to Heisenberg' s 
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formulation ; in particular, we have seen how the pairs of 
canonical variables then result from the requirement of irre- 
ducibility, whence the number of parameters in such an irre- 
ducible Abelian group must be even}^ 

If one of the canonical co-ordinates, say is a cyclical 
co-ordinate with period 27r, then all quantities of the physical 
system are represented by periodic functions with period 27r. 
Consequently the only values assumed by the parameter r 
associated with q in (14.8) are multiples of 27 t and the integral 
is to be replaced by a sum. In such a case we are no longer 
dealing with a continuous group, but with a mixed (continuous- 
discrete) group. 

Our general principle allows for the possibility that the 
Abelian rotation group is entirely discontinuous, or that it 
may even be a finite group. Thus we have discussed in III, 
§ 16, a group of order 4 and an irreducible ray representation 

of it in 2 dimensions. That such groups actually occur in 
Nature is shown by the fact that the group we have just men- 
tioned characterizes the kinematics of the electron spin dis- 
cussed in § 4. It can be readily shown that ® is the only 
irreducible representation of this group, and that it is in fact 
the only irreducible 2-dimensional group of unitary rotations in 
ray space. These results emphasize the remarkable nature of 
this simplest case. The quantization of the problem of several 
electrons discussed in § 11 also falls within our general scheme. 
In dealing with it we are interested in that Abelian group whose 
basic elements (a = 1, 2, • • •, 2/) are all of order 2 ; such 
a group consists of the totality of the 4f different elements 

• ‘ • K 1 or 0). 

The gauge can be so chosen that the corresponding unitary 
matrices p« in the irreducible ray representation in 2f dimensions 
satisfy the equations 

P« = 1. P/)P« = - P«P/) (« 4= ^). (14.10) 

The kinematics of the spinning electron is described by the 
simplest case / = 1 of this representation. 

Because of these results I feel certain that the general scheme 
of quantum kinematics formulated above is correct. But the 
field of discrete groups offers many possibilities which we have 
not as yet been able to realize in Nature ; perhaps these holes 
will be filled by applications to nuclear physics. However, it 
seems more probable that the scheme of quantum kinematics 
will share the fate of the general scheme of quantum mechanics : 
to be submerged in the concrete physical laws of the only existing 
physical structure, the actual world. 
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§ 15. Derivation of the Wave Equation from the 
Commutation Rules 

We now show by actual construction that there exists but 
one irreducible ray representation (excluding the identity) of 
a 2-parameter continuous Abelian group : namely, that one 
which leads to the wave equation. 

We obtain our 2-parameter continuous group as the limiting 
case of a finite group with 2 basic elements ; our proof is rigorous 
only insofar as the validity of this limiting process is admitted. 
Let A, B be two commutative rotations of an w-dimensional 
ifnitary space. On introducing the gauge we have an equation 
between their matrices : 

AB = eBA, (14.1) 

in which, as we know already, e is an root of unity. The 
system consisting of the two matrices A, B shall be irreducible. 
Let their commutator, the number 6, be a primitive root of 
unity, i.e. s’" is the lowest power of e which is equal to 1 ; w is 
then a factor of n. The orders of the rotations A, B are also 
factors of n : — I, 5" — 1, so the gauge may be chosen in 

such a way that A" = 1, 5" = 1. Let B be reduced to diagonal 
form by an appropriate choice of our normal co-ordinate system ; 
the elements in the main diagonal are then all roots of 
unity. Equation (14.1) then yields the following conditions on 
the elements of A ||a,t|l : 

= (15.1) 

We divide the indices t and the corresponding variables 
into classes in accordance with the rule that t and k belong to 
the same class if the quotient bijbfc is an root of unity, i.e. 
a power of e. That this process really results in such a division 
into classes is shown by the fact that if bt/bic and bf^bi are powers 
of e, then bi/bi is also. By (15.1) “ 0 if z and k belong to 

different classes ; hence the matrix A is reduced in accordance 
with the division of the indices into classes. But in view of 
the assumption that the system B was irreducible there can 
therefore exist but one such class. 

Having established this result, we now proceed to a finer 
division into classes : i and k shall now be considered as belonging 
to the same class if fr, — bj^. We arbitrarily choose as the first 
of these classes that one for which fr, = b and let the second 
consist of those for which 6, = e6, the third with fc, = s^b, • • •, 
the with bi = e^~^b ; this exhausts the set, for the (m -j- 1)*^ 
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class hi — coincides with the first. Let the variables be 
arranged and numbered in this order. It then follows from equa- 
tion (15.1) that all sub-matrices (i, k) of the matrix A are empty, 
i.e. = 0, unless their row index i and their column index 
k belong to successive classes. The matrix A then has the 
form indicated in Fig. 3, in which all elements in the non- 
shaded portions arc zero (and we have taken m — 4). The 
shaded portions are occupied by the sub-matrices A^^\ A^^\ 
• • •, A^^K Since A is unitary the sum of the squares of the 
absolute values of the elements of a row or column is 1 ; the 








iJl 




1 

\ A N 






Fig. 3. 


same must therefore also hold for the rows and columns of 
each of the sub-matrices. The sum of the absolute values of 
the squares of all elements in must then be equal, on the 
one hand, to the number of rows and, on the other, to the number 
of columns ; the rectangle A^^^ is consequently a square, and 
the number of indices in the second class is equal to the number 
in the first class, say d. By the same argument we see that 
the number of individuals in each of the m classes is d, and hence 
n = md. The figure is to be corrected accordingly ; each of 
the shaded matrices is now unitary. On subjecting the variables 
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the first class to the unitary transformation with matrix 
the sub-matrix is reduced to the ^i-dimensional unit 
matrix. This normal form is undisturbed by a unitary trans- 
formation affecting the variables of the first set and the variables 
of the second set alike ; we can therefore reduce the second 
sub-matrix to a multiple of the ^-dimensional unit matrix, and 
so on through the (m — 1)®^ The normal form so obtained is 
unchanged on subjecting the variables of each class to the same 
fi^-dimensional unitary transformation ; we may therefore choose 
as this last transformation one which reduces to diagonal 
form. But the matrix A is then decomposed into ^i-sub-matrices, 
as can be seen by renumbering the variables, taking first the 
first members in each set, then the second, etc. The irreduci- 
bility assumption then tells us that there can be but one member 
in each set : d I, n ~ m. Our matrices are now in the normal 
form : 



0 1 




0 ] 


£r.l 

A - 

0 1 


£'■•2 


a 0 0 0 • • • 0 


en.r-1 


all elements not explicitly indicated are zero. The exponents 
in B are n successive integers and e is a primitive root of 
unity. Finally, the equation A^ — ^ yields ^ = 1. We number 
the variables from r on and take indices which are congruent 
mod. 71 as equal ; the two correspondences are then 

A : x[. B : x[ = 

On reiteration we find 

A ^ : x'l, ^ B^ : x[. -= (15.2) 

The transition to continuous groups is now accomplished by 
passing to the limit 7L -> oo. Let the basis ?P, iQ of the con- 
tinuous 2-parametcr Abelian rotation group be normalized in 
accordance with (14.9). We identify the matrix A of the above 
considerations with the infinitesimal e{^F) and B with e{riQ) 
where ^ and r] are real infinitesimal constants. Then c(crP) = 
when in the limit cr, tr] -> r. £ is now 

and e[rQ) represents the physical quantity 

e ; the values which it may assume are given by where 
T is real and k runs through all integral values. In other words, 
the quantity q may assume the values ; q viay assume all 
real numbers frorn — oo to -f- oo. (Of course k is to be con- 
sidered mod.n and mod.n^, but is a multiple of ’Inliq 
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and may consequently be infinite in the limit.) We therefore 
write q in place of ki^ where q is understood to be a variable 
which runs through the possible values of the physical quantity q^ 
and * ^f{q) in place of Xj^. ilf{q) is an arbitrary function, 
whose values are complex numbers, which satisfies the normalizing 
condition 

On passing to the limit in the second equation of (15.2) we 
find that the quantity is represented by the linear operator 

Similarly we find from the first equation of (15.2) that 

^(^) ^{q + a) 

is the operator representing On returning from finite to 

infinitesimal unitary transformations we find 

q-.m)=q- m P : S0(,) = 1.^. (15.3) 

We have thus finally justified the assumption from which we 
started in Chapter II. 

The extension of these results to systems with several degrees 
of freedom causes no trouble. The kinematics of a system lifhich 
is expressed by a continuous Abelian group of rotations is conse- 
quently determined uniquely by the number f of degrees of freedom. 
The postulate of irreducibility allows us to conclude that the 
particular operators (15.3) of the Schrodinger theory are a 
necessary consequence of Heisenberg’s commutation rules. 

P. Jordan and E. Wigner have given a very elegant group- 
theoretic proof that there exists but one irreducible matrix 
solution of equations (14.10), i.e. that one of degree 2^ there 
mentioned and given in greater detail at the end of §11. 
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THE SYMMETRIC PERMUTATION GROUP AND THE 
ALGEBRA OF SYMMETRIC TRANSFORMATIONS 

A. General Theory 

§ 1. The Group Induced in Tensor Space and the 
Algebra of Symmetric Transformations 

'^HE principal problem we propose to solve in this chapter 
j is the group' tiuoretic classification of line spectra of an atom 
consisting of an arbitrary number, say /, of electrons, 
taking into account the reduction of the space to as re- 

quired by the Pauli exclusion principle, and the spinning electron. 
For this it is necessary to consider in detail the representations 
of the symmetric group, i.e. the group tt/ of all /! permutations of 
/ things. These are most intimately related to the representa- 
tions of the group U of all unitary transformations or the group 
C of all homogeneous linear transformations of a space 9^n• 
This connection has already been touched upon in Chapter HI, 
§ 6 : the substratum of a representation of C or U consists of the 
linear manifold of all tensors of order / in which satisfy 
certain symmetry conditions, and the symmetry properties of 
a tensor are expressed by linear relations between it and the 
tensors obtained from it by the/! permutations. 

A tensor F of order / in the n-dimensional vector space 91 — 
is defined by its nf components or, as we prefer to say, “ co- 
efficients ” Ffi^i^ • • • if) ; each of the indices i runs from 1 to n. 
Tensors can be added and multiplied by arbitrary numbers ; 
hence the totality of such tensors F constitute a linear vector 
space ” 91^ of nf dimensions. Further, F can be subjected to 
an arbitrary permutation s of its / indices, which can be thought 
of as a permutation of the / numbers 1, 2, • • *, / attached to 
the indices i in the general component above ; if 5 is the per- 
mutation 

1 ^ 1 ', 2 ^ 2 ', . • •,/->/' 

281 
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then the tensor sF obtained by applying 5 to F is, by definition, 
that tensor whose coefficients are 

sFiiii^ • • • if) = F(fi42' • • • if). (l.l) 

It follows from this definition that for any two permutations 
6' and t 

t{sF) {ts)F. 

A linear correspondence F F' : 

• • • v) - if \ k,- • k,)F{k, • • • k,) (1.2) 

(k) 

is said to be symmetric if the coefficient 

a{ii ‘ if-, ki • • • kf) 

is unaltered on subjecting the sub-indices 1, 2, • • *, / of both the 
indices i and k to the same arbitrary permutation s. The pro- 
cesses of addition, multiplication by a number and permutation, 
in the sense defined above, applied to tensors are invariant 
under symmetric linear transformations ; and conversely, any 
transformation of tensor space under which these processes 
are invariant is linear and symmetric. The totality of symmetric 
correspondences constitutes an algebra Z : i{ A and B arc ele- 
ments of Z then A B, AB and cA {c an arbitrary number) 
are also. > The problem with which we shall concern ourselves 
is the reduction of into linear sub-spaces ^ which are in- 
variant with respect to Z*, i.e. with respect to all symmetric linear 
transformations. Wherever in the following we employ the 
terms invariant, irreducible, etc., in referring to the tensor space 
91-f, they are to be interpreted with respect to the algebra Z. 

We give a brief resume of our terminology. We are dealing 
with a vector space and a system Z of linear correspondences 

i A ]C 

of 9R on itself ; we may often prefer to use the term “ linear 
projection ” instead of “ linear correspondence (operator) in 
order to bring out the fact that the correspondence need not 
be one-to-one. A (linear) sub-space ^ of 91 is invariant if an 
arbitrary projection A of the system Z sends every vector 
J of ^ over into a vector of ^ ^ is irreducible if it contains 

no invariant sub-space other than itself and the space 0 con- 
sisting only of the vector 0. We shall always understand by 
a complete reduction ^ + ^2 invariant sub-space 

^ a complete reduction into two linearly independent invariant 
sub-spaces ^ 2 ? even when this is not explicitly stated. A 
linear projection j ^ of the invariant sub-space ^ on the 
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invariant sub-space is similar if two vectors j and t) of ^ 
which are related by a correspondence A of the system : t} = A]C^ 
are always projected into two vectors and t)' of which are 
related by the same A : t)' ~ A j'. ^ and arc similar or 

equivalent: ~ ^ if a one-to-one linear and similar corre- 
spondence can be set up between ^ and In particular, 

these concepts are to be applied to the case in which the vector 
space is the tensor space W ~ of dimensions and is 
the totality of symmetric transformations. 

In quantum theory the state of a system consisting of / 
equivalent individuals (electrons) with a system-space 31 is 
described by a tensor of order / in 3^. The energy necessarily 
depends on each of the / individuals in exactly the same way ; 
hence the Hermitian operator which represents the energy is 
necessarily symmetric in our sense. The fundamental dynamical 
law therefore allows us to conclude that an invariant sub-space 
^ of 31>f has the property that if the tensor describing the state 
of the system is at any time in ^ no influence whatever can drive 
it out. A complete reduction of W into invariant sub-spaces 
^ implies a corresponding reduction of the operator representing 
the energy ; hence the term spectrum is reduced into classes 
of terms belonging to the various such that the members of 
one class can under no conditions combine with the members 
of another. Naturally this division into non-combining classes 
is to be carried as far as possible. But this problem is exactly 
the one proposed above — the only difference being that we are 
here only concerned with the totality 27^^^ of symmetric Hermitian 
operators. However, this restriction is quite irrelevant, for 
any symmetric operator can be written in the form A — A^ iAo 
where 

are both Hermitian. 

On going over to a new co-ordinate system in the fundamental 
vector space 3i by means of a non-singular transformation 

x- = £ a(tk)x/^ (1.3) 

1 - 1 

the coefficients of a tensor F are transformed in accordance with 

• • • if) = EO'{i\k\)(*'{i2k->) ' ' ’ ‘ ^(^ 1^2 ‘ ' ‘ 

a-) 

The transformation (1.3) in vector space induces the symmetric 
transformation (1.4) in tensor space. These induced trans- 



284 THE SYMMETRIC PERMUTATION GROUP 


formations, which we shall call “ special symmetric transforma- 
tions"' constitute a group which is isomorphic with the com- 
plete linear group 0 = 0^; this representation of c was previously 
denoted by (c)A The group Zq is contained in the algebra S. 
Hence a sub-space ^ of W which is invariant under the algebra 
Z is a fortiori invariant under the group Z^. That the converse 
of this result is also valid is not so self-evident. Nevertheless 
for all questions involving only linearity Zq can be replaced by 
the more extended Z*, for Z is what we might call an enveloping 
algebra for the group Zq ; by this we mean that any symmetric 
transformations can be expressed as a linear combination of 
appropriately chosen special symmetric transformations.^ To 
show this we prove the theorem : 

A homogeneous linear relation 

E c{ii ••• if ] ki ■ kf) x{ii if] ki ' - ■ kf) = 0 (1.5) 
(«: *•) 

is satisfied identically by all symmetric transformations 

WAh if] • • • M!l, 

if it is satisfied by all special symmetric transformations, i.e. if 
the equation 

Ec{ii if] ki • • ' kf)x{iiki) • • • x{ifkf) = 0 (1.6) 

«;*) 

is satisfied for all values of the variables x{ik) for which 
the determinant \x(ik)\ 4 = 0. 

Proof. Denoting the pair [ik) of indices by j and calling the 
= m values of j simply 1, 2, • • •, m, the left-hand side of 
(1.6) is a homogeneous polynomial of order /in the m variables 
x(ik) — Xj : 

■ ■ ■ X„) = Eb{fi, U • • •, U)x{'x{^ • • • xt' 
if) 

where/i + /2 + - * * + /m --/and ^(/,,/2, • • % fm) is 7Trt . 77 7" ! 
times that coefficient c(jjj.^ • • • jf) whose indices, contain j = 1 

/i times, y = 2 /j times, etc. On denoting that variable ;i:(y,_;2 ’■ ‘i/) 

in which the indices _/’ = 1, 2, • • •, wt occur fi, /2 • • •, f,n times by 
yifit /a. ’ ■ 'i fm) the left-hand side of equation (1.5) becomes 

Eb{fi, h, • • fm)y{fi, h, • • •, fm)- 
(/) 

The determinant of the x{ik) is a certain polynomial D{x^Xz • • • x^ 
in the variables Xf. Our assertion is thus reduced to the well- 
known theorem of algebra : let ^(^), D{x) be two polynomials 
in the variables x^x^' • • x„, the second of which does not vanish 
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algebraically, i.e. its coefficients do not all vanish. If (f>{x) is 
zero for all values of the variables for which the value of 
D{x) =1= 0, then <f>{x) vanishes algebraically. 

This theorem is proved for a single variable x as follows. 
If <l>{x) does not vanish algebraically it has a definite degree 
^ ^ 0 ; let ^ be the degree of D{x). There are then at most 
p q values of the variable x for which <f>{x) or D{x) vanish ; 
for any one of the remaining infinitude of possible values of 
X neither (l>{x) nor D{x) can vanish, contrary to assumption. 
The theorem is readily extended to polynomials in any number 
of variables by mathematical induction. The principal point 
is that the analytical vanishing of a polynomial for all values of 
the independent variables implies that it vanishes algebraically. 

In quantum theory the vector space is unitary ; the transi- 
tion from one normal co-ordinate system to another such is 
accomplished by an arbitrary unitary transformation (1.3). 
The transformations thus induced constitute a sub-group 
of 2*0 which is isomorphic to the unitary group u„, i.e. the 
representation (u)f of the unitary group. I assert that a sub- 
space of which is invariant and irreducible with respect 
to Z remains irreducible not only under the group but under 
the more restricted group as well. To prove this we must 
show that the identity (1.5) holds even when we assume only 
that (1.6) is true for those values of the variables x(ik) with 
unitary matrix. 

One of the most natural proofs of the above theorem con- 
cerning the formal vanishing of a form (f> of order / depends on 
the process of “ polarization ” : we assign arbitrary infinitesimal 
increments dx, to the values of the variables Xj ; the identical 
vanishing of then allows us to conclude that the differential 


i 


dxj 


vanishes for arbitrary values of Xf and dx{. This procedure 
also leads us to the desired conclusion in the case under con- 
sideration. Denoting by 0 the matrix obtained by transposing 


rows and columns in 


we have 


\i)x{ik) 

tr {0dX) 


where A, X -j- dX are two arbitrary neighbouring unitary 
matrices. In order that this be the case we must have 


dX = iX • 8X 



286 THE SYMMETRIC PERMUTATION GROUP 


where 8 X is an arbitrary Hermitian matrix : the “ rotation ” 
X + dX is obtained by following up the rotation X with the 
infinitesimal rotation 1 + ^ ‘ But the equation 

tr ( 0 X • SX) = 0 

implies the vanishing of 0 X. This is seen immediately from 
the fact that a linear form 

Vik 

in the variables = 8 x{ik) vanishes identically if it vanishes 
for all values satisfying the condition ; indeed, any 

matrix Y — ||y,fc|| can be written in the form + 1^2 where Yi 
and Y2 are Hermitian. On multiplying the right-hand side 

of 0 X = 0 by we find 0 = 0: all derivatives - > 

lx{tk) 

vanish in the same sense as cf) itself, i.e. for arbitrary x{ik) whose 
matrix is unitary. But these derivatives are forms of order 
/ — 1 ; the truth of our assertion above is thus proved by 
mathematical induction. 

Every invariant sub-space $ of W is the representation 
space of representations of the groups c and u which arc con- 
tained in (c)-^ and (u)-f respectively. Hence the above results 
prove that if ^ is irreducible these representations are also. 

§ 2. Symmetry Classes of Tensors 

One of the most natural methods of obtaining invariant 
manifolds of tensors F consists in subjecting F to linear symmetry 
conditions of the form 

ya{s)^sF = 0, ( 2 . 1 ) 

« 

This suggests introducing the symmetry operator 

a = Z!a{s) • s. ( 2 . 2 ) 

8 

Such operators can be added and multiplied with arbitrary 
numbers, and two operators a, b can be applied successively 
with the same result as the symmetry operator c ~ ba defined by 

c{s) - i:b{t)a{t'). ( 2 . 3 ) 

tv = 8 

In other words, we are here led in a most natural way to the 
algebra p of the symmetric group tt = 77/ of all permutations s. 
The elements of this algebra, which constitute an /!-dimensional 
linear space r, appear as operators which can be applied to 
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tensors of order / We may call the numbers a{s) appearing 
in (2.2) the components of the element a. In particular, a is 
an Hermitian operator in the tensor space W if it is a real 
element, i.e. if it coincides with its Hermitian conjugate a 
defined by the equation 

a{s) = a(s~^). (2.4) 

Hence these real symmetry operators represent physical quan- 
tities of the physical system consisting of/ equivalent individuals, 
whose total system space is ; quantities of this kind are 
unknown in classical physics and cannot be pictured in terms 
of the usual spatial and temporal models.^ 

' (2.1) or 

ya{s)x{s) = 0 

B 

is a linear condition which is imposed on the element x ~ F 
defined by a;(5) — sF. A symmetry class is defined by one 
or more equations qf this kind ; we are thus led to the definition : 

Each linear sub-space p of t determines a symmetry class ^ 
of tensors. F belongs to ivhen the corresp07iding symmetry 
quantity or element F is in It will be found convenient to 
denote the process by which ^ is generated from -p by a symbol ; 
we write 

If the reader finds it difficult to operate with elements F 
whose components sF are tensors rather than numbers he may 
replace the tensor by the totality of its coefficients F{ii *2 • • * if) 
and F by the elements 

X = F{iii^ ’ • • if) 

associated with each definite set of indices {i^i^ • * • if) ; this x 
is defined by the equation 

.t( 5) := sF/z/u • • • if). 

The requirement that F belong to p means that /"(zjZg ‘ ' v) 

belongs to p for all the n^ possible combinations of the mdices i. 
That the symmetry class ^ = ^p is invariant with respect to 
all symmetric transformations (1.2) is due to the fact that (1.2) 
implies the corresponding equation for the elements F, F\ 
F'(ziZ 2 • * • if) is a linear combination of the elements F{kik 2 * * • fe/) 
associated with the various combinations (k^k 2 ‘ ’ * ^/) of indices k. 

If F belongs to p then a • F does also, where a is any element 
whatever of the algebra. To show this we note that the 
^-component of 

H{ii • • • I,) = a • F(ji • • • if) 
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is given by 

■ rsF{t, • • • if) = 2:a{r-^) • sF{ki ■ ' • k,) 

r r 

where the * * *, are obtained from fj, • • •, if by the per- 
mutation r. Hence H (zi, • • *, if) is a linear combination of those 
F{ki • • • kf) whose indices k are obtained by a permutation of 
the indices i. 

The principal question now is whether every invariant 
sub-space ^ can be generated from a p by the process and 
further, whether or to what extent this generating is uniquely 
determined by The answer is perhaps best expressed with 
the aid of the inverse process tj which generates a from the 
given The following geometrical analogy may be useful 
in enabling the reader to understand the situation with which 
we are dealing. Let the points x of a plane with a fixed centre 
correspond to the elements of the algebra p and the line segments 
F going out from the origin correspond to the tensors. On 
contracting the entire plane, leaving the centre invariant, in 
the fixed ratio r (0 ^ r ^ 1) the point x goes into the point 
rx and the segment F into the segment rF ; this contraction 
of segments shall be the analogue of the symmetrical trans- 
formations of tensors. ^ will now denote an “ invariant ” 
set of segments, i.e. a set such that if it contains the segment F 
it also contains all the contracted segments rF. Just as we 
associated the symmetry elements F{ii • • • if) with the tensor F 
we now associate with the segment F the continuum of points 
F(t) of F ; F{t) is the end point of the segment rF. Let p be 
any set of points ; the segment F will then be included in the set 
^ if and only if all its points F{r) are in p. Obviously the 

only segment sets ^ which can be obtained in this way are 
those which are invariant, and all such invariant sets can be 
so obtained. Only the “ core ” pQ of the point set p is essential 
to this construction ; consists only of those points x such 
that rX belongs to p for all r (in the interval 0 ^ r ^ 1). po 
is invariant in the sense that with x all rx belong to ;po* That 
only the core po is essential means that our construction generates 
the same segment set ^ from two point sets :p, p' if these latter 
have the same core ; hence we can restrict ourselves ab initio 
to the consideration of invariant point sets p — pQ. It is extra- 
ordinarily easy to find the point set p which generates a given 
segment set ^ : we include in p those and only those points 
lying on the segments of and this p is automatically invariant. 

If the reader will think through this geometrical illustration, 
which we have formulated here in such a pedantic manner, he 
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will have no trouble in understanding the analogous situation 
for tensors and symmetry elements. A linear sub-space ^ of r 
is to be called invariant if all elements ax are in p, where x is 
an arbitrary element of p and a is any element whatever.* 
Hence such a p is invariant under the totality of correspondences 
of the form 

{a) : X x' ax (2.5) 

On associating this correspondence {a) of t on itself with the 
element a we obviously obtain a representation of the algebra p 
(and therefore of the group tt/) ; it is called the regular 
representation . (r appears here twice : once as the repre- 
sentation space and again as the algebra p represented in this 
space ; the first will be expressed by the German letter r, the 
second by the Greek p. We are here doing the same thing as 
in III, § 2, where we obtained a realization of the group g by 
associating with the element a of g the correspondence s ^ s' = as 
of the group manifold on itself.) This regular representation 
supplies us with material from which we can construct all — 
and hcrce in particular the inequivalent irreducible — repre- 
sentations of the algebra p. When we use the terms invariant, 
irreducible, etc., in t they will always refer to the algebra of all 
correspondences (a) of r on itself, which is simply isomorphic 
with the algebra p of all .symmetry elements a. p being an 
invariant sub-space of r, we shall always refer to the representa- 
tion induced in p by the regular representation simply as the 
regular represetitation in p ; it associates with each element a 
the correspondence (2.5) of p on itself. The equation x' = ax 
is, in terms of components, 

x'{s) = 2Ja{r~'^)x{rs). 

r 

Let X be an arbitrary clement of p ; the requirement that p be 
invariant allows us to conclude that the element x' defined by 
x'{s) — x{rs) is also in p, where r is any fixed permutation. 

Let p be an arbitrary sub-space of t ; we say that x belongs 
to the core po of p if and only if all quantities of the form ax 
belong to p ; this po is invariant. We thus have the theorem 
that two linear sub-spaces p, p' generate the same symmetry 
class ^ = j|p = p' of tensors if they have the same core. We 
may therefore restrict ourselves ab initio to the consideration of 
invariant sub-spaces p. 

* This " invariant sub-space " is not the same as an “ invariant sub- 
algebra as defined in Chap. Ill, § 13 ; to conform with our previous nomen- 
clature it should be called a left-invariant sub-algebra.” 
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It is possible that certain relations (2.1) will be satisfied by 
all tensors. Let to denote the smallest sub-space of r which 
contains the elements F{ixii, • * • if) associated with all tensors 
F and all values of the indices • * • if). Then p generates 

the same ^ = :jfp as the intersection of p with to ; it is therefore 
natural to restrict ourselves further to the consideration of 
invariant sub-spaces p of Tq. These remarks are not applicable 
if the dimensionality n'^f, for certainly the /! coefficients 

sF(l,2, • • •,/)=.F(r,2', • • •,/') 

of the arbitrary tensor F are independent. But the situation 
is different in case n<f: for example, let 8, = ± 1 according 
as s is an even or an odd permutation ; then 

U^s-sF 

is an anti-symmetric tensor and must therefore vanish in case 
the dimensionality n is less than the order /. 

We can at most hope that conversely p is uniquely determined 
by ^ if we restrict ourselves to invariant sub-spaces p which are 
contained in Tq. In order to prove that this is indeed the case 
we attempt to find the inverse process which leads from ^ 
to p, following the programme outlined by the geometrical 
analogy considered above. In case n ^ / this is readily done 
as follows : if F is any tensor in ^ we let the element 
X — F(l, 2, • • /) in r correspond to it ; p consists of all the 
elements x so obtained. But in order to obtain a method which 
is also applicable to the case n < / we must alter the procedure. 
We understand by p — the smallest linear manifold containing 
the totality of elements F{ii, i^, • • •, if) associated with all possible 
tensors F of % and all possible combinations of indices {if.^ • • • if). 
If the tensors £« constitute a basis for p consists of all elements 
of the form 

X = EEcoiiii • • • if) - • • • if) (2.6) 

« (0 

That such a p is invariant has already been shown above, for 
if Jf = F{if,z • • • if) the element x' defined by x'(s) = x{rs) is 
equal to F[kikz - • • kf) where kik^ • ’ ' kf are obtained from 
' if by the fixed permutation r. 

We now denote the to introduced above by ; it coin- 
cides with the entire space r when n ^f. Let the symbol -3 
denote “ is contained in ” ; the following results then follow 
immediately from the definitions : If p is a linear sub-space of 
t and ?P = ^p, then tj^ -g p. If ^ is any linear sub-space of 
SRf and p = 1^?P, then conversely ^ -3 jj:p. We can at most 
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expect that the symbol -§ can be replaced by = if in the first 
theorem is an invariant sub-space of to and in the second if 
^ is an invariant sub-space of That these converse theorems 
are in fact true under these limitations will be proved in § 4. 

§ 3. Invariant Sub-spaces in Group Space 

We are in need of a fundamental theorem concerning the 
algebra of a group as a preparation for carrying through the 
investigation proposed above ; we here prove this theorem for 
a general finite group. However, we do not alter the notation, 
so here tt denotes any finite group of order h. 

Theorem (3.1). //p is an invariant sub-space of t there exists 

an element e of the group algebra having the following two prop- 
erties : (1) every element of the form xe belongs to p, (2) every 
element x of p satisfies the equation xe = x. 

In particular (1) implies that e = le itself belongs to p, 
and hence by (2) ee = e; e is idempotent} It is a “generate 
ing unit” of p in the sense that p consists of all elements of the 
form xe. 

Proof. Let e,, e^, • * *, e* be a co-ordinate system in the 
vector space t which is adapted to the g-dimensional sub-space 
p in such a way that p is the linear set defined by e^, e^, • • •, e,. 
The parallel projection which transforms 

X = + • • • -f XhCh into x' = x^e^ -f * • • + x^eg 

has the two properties (1) it projects every x into an x' lying in 
p, and (2) within p it is the identity. In the original co-ordinate 
system defined by the simple elements s of the algebra this 
projection is given by 

^'(^) = Ed[s, t)x(t), 

I 

where the matrix d(s, t) is necessarily of the form 
d{s, t) = efs)ei{t) + • • • -f eg{s)eg{t) 
and the c,(5) are defined by 

i:t\{s)e^{s) = 8,.,. (f, ^ = 1, 2, • • •, g). 

8 

The fact that p is invariant implies that if x is in p then the 
element Xr defined by Xr{s) — x{rs) is also in p. Consequently 
the projection with the matrix d{rs, rt) has the same two prop- 
erties (1) and (2), where r is any fixed permutation (i.e. element 
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of the group tt) whatever. Hence the assertions also hold for 
the correspondence with the matrix 

0 rl) (3.2) 

obtained by summing over all elements r of the group. This 
matrix satisfies the equation 

e(rs, rt) = e{s, t), 

whence e{s, t) depends only on the combination r^s : e{s, t) = 
e{r^s). The linear projection 

x'{s) = Z^{s, t) x{t) 

t 

may therefore be written briefly x' — xe, which proves the 
validity of the theorem. 

Let the invariant sub-space p be completely reduced into two 
invariant sub-spaces : p = pi -b p 2 . let e be the generating 
unit of p. Any element in p can be written as the sum of 
its components in p^ and pj ; hence in particular e = Cj + ^ 2 - 
From this it follows that for an arbitrary element x of p 

X — xe = xci + xe^. 

But since = xe^ is in pi and x^ = xe^ is in p 2 , x^ and Xj 
are the (unique) components of x in p, and p 2 . These two 
components for the element Ci are obviously Cj and 0, whence 

eiCi = ©1, ©162 = 0 ; 


similarly 


©2©i — 0^ ©2©2 — ^2* 

Hence ©j, ©2 are the generating idempotent units of pj, pj re- 
spectively ; they are “ independent " in the sense of the 
equations 

Ci©2 — 0, ^2^1 ~ 

On completely reducing p into anv number of components : 

p = 2:pi, the generating unit © of p is decomposed into 

% 

e = 

i 


the components of which satisfy the analogous equations 


©1 ©fc = 0 {i k), ©,• ©1 — ©i. 


The existence of the generating unit offers a means of ob- 
taining a new and simpler proof of the fact that reducibility 
implies complete reducibility : 
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Theorem (3.3). //^, are invariant and pi -2 p, then p can 

be reduced into pi -+■ p2 ^ ^hat p2 is also invariant. 

Proof. Let be the generating unit of pi. We decompose 
every element of p in accordance with the equation 

x= xe^ + {x — xe^). ( 3 . 4 ) 

The first component x^ = xe^ lies in and the second 

Xj — X — xei 

runs through a certain linear sub-space p2 of p when x runs 
through all elements of p. This sub-space p2 is also invariant, 
for 

ax2 — ax — {ax)ei 

as ax is in p if x is. The elements x^, X 2 of pi, p2 respectively 
satisfy the equations 

x^ei = Xi, X2e, =- 0. 

From this it follows that the sum of an element Xi of pi and an 
element Xg of p 2 cannot vanish unless both^i and Xj also vanish ; 
hence p^ and p2 are independent. To prove this we merely note 
that on multiplying -f Xg — 0 by we find Xi^i =r. y^ = 0. 
Equation (3.4) represents the reduction of any element of p 
into its components in pi and p2» 

Any idempotent element e generates an invariant sub-space 
pe consisting of elements of the form xe. If e^, 02 are two 
independent idempotent elements (e^eg — 0, e 2 Ci = 0) then the 
sub-spaces pi, p2 which they generate are independent, and the 
idempotent element e ~ + ^2 generates p = + P 2 * An 

idempotent clement e is said to be primitive if it can only be 
expressed as the sum of two idempotent elements ej + ea if 
one of the summands is 0 (and the other e). In order that p^ 
be irreducible it is necessary and sujjicieyit that e be primitive. 

Obviously any idempotent element e, in particular the 
modulus 1 of the algebra, can be reduced into the sum of 
independent primitive idempotent elements. For if we have 
a reduction into independent non-vanishing idempotent elements 

e - ej + eg + • • • + 

and if, for example, is not primitive, it can be further re- 
duced to the sum of two independent non-vanishing idempotent 
elements ef + e/' ; in this way we obtain a complete reduction 
of e into m + 1 independent terms, for we have, for example, 

e^eg — = 0 ; similarly e^e'i = 0. 
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This process must certainly cease after at most h steps. Our 
analysis allows us to assert that we thus obtain a complete 
reduction of into independent irreducible sub-spaces. 

We have seen that the theorem concerning the complete 
reducibility is a consequence of the existence of a generating 
unit. But the converse is also true : If p appears as a summand 
in a complete reduction r = p + of our given algebra t, then 
it possesses a generating unit. We need only to specialize the 
considerations developed above by applying them to the modulus 
1 of r ; 1 can be completely reduced into the two components 
e + e' lying in p and p\ and the generating units of p and p' 
are e and e' respectively. 

The mathematician will find it worthy of note that all these 
considerations are still applicable when the algebra is defined 
over any field whatever. Instead of dealing with the continuum 
of real or complex numbers, as in analysis, we may in abstract 
algebra operate in an arbitrary fields i.e. a domain of elements, 
called numbers, in which the two fundamental operations of 
addition and multiplication and their inverses, subtraction and 
division, are defined in accordance with the formal laws of 
ordinary arithmetic. Our development depended only on these 
rules of operation — with a slight restriction. There are fields in 
which a definite integer, say A, times any number of the field 
yields zero ; we may say that h annihilates. Such “ modular 
fields must be excluded, for we wish to retain the possibility 
of finding a number such that its product with h is any given 
number. When our reasoning involves no more restrictive 
assumptions concerning the number field, we are operating in 
a relatively elementary theoretical domain. However, such 
theorems as the “ fundamental theorem ” III, (10.5), and that 
of Burnside-Frobenius-Schur, which depend on the fundamental 
theorem of algebra, belong to a deeper layer. These theorems 
hold only in “ algebraically closed ” number fields, in which 
any algebraic equation (with coefficients in the field) is soluble. 
Finally such concepts as “ Hermitian,” “ unitary,” etc., involve 
the transition from a number to its conjugate complex and 
have no place in general abstract fields. Our earlier proof of 
the theorem of complete reducibility was obtained with the 
aid of such tools foreign to the general concept of a field. 

Theorem (3.5). A similarity projection x x' of the invariant 
sub-space on the invariant sub-space p' is necessarily expressed 
by an equation of the form x' = xb. (In particular, when p 
and p' are equivalent this theorem is applicable to the one-to-one 
similarity correspondence p ^ p\) 

Proof. Let the given similarity correspondence send the 
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generating unit e of p over into b. In virtue of the similarity 
xe then goes over into x' = xb^ where x is any element in p ; 
but for such an element xe == x. 

Additiofial remark. The projection sends e into eb ; hence 
eb = b. On the other hand, if e' is the generating element of 
p\ then since b is in p' we have be' = b : 

b — eb = be' = ebe\ 

We express this result, i.e. that b is of the form exe\ by saying 
b has the character (e, e'). Our considerations show that such 
a projection can always be expressed in terms of a unique 
element b of character (e, e'). 

If we are operating in the field of complex numbers, with which the 
investigations of analysis (e.g. the theory of functions) deal and in 
which we are exclusively interested in quantum theory, we may supple- 
ment the theorem ( 3 . 1 ) concerning the existence of a generating unit e 
in an invariant sub-space p by the following : 

The generating unit may be so chosen that it is real ; it is then deter- 
mined uniquely by \\ 

To prove this we choose as the basis e,, . . of p a unitary- 
orthogonal system of vectors ; then 

re ,(S) efc(s) = 8,.* (t, A, = 1, 2, . . .. g). 

8 

In constructing (l{s, t), which we now denote by e(s, t), we may therefore 
choose : 

e(s. t) = ie,(s)?^(t), (3.6) 

1 assert that the equation 

e(rs, rt) = e{s, t) (3.7) 

is automatically satisfied — it is no longer necessary to take its mean 
value as in (3.2). The element e defined by e(t~^s) = e(s, i) is then the 
real generating unit of p. 

In order to establish the validity of (3.7) it is only necessary to 
note that e(s, t) is independent of the particular unitary basis ej, 

. . Cg chosen ; for on going over to a new unitary basis e^, 
e' by a unitary transformation U the bilinear form (3.6) remains in- 
variant. Now in particular the equation 

e'.(s) = e^irs), 

in which y is a fixed element of the group, defines a transition to a new 
unitary basis. 

To prove that this real generating unit e of p is unique, assume there 
exists a second, e' ; then all elements jr of p satisfy the equations 

xe — X, xe' ■= X. 

On applying the first equation for jr = e' and the second for x == e we 
have 

e'e = e', ee' = e. 
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But since e and e' are both real, the first of these results yields, on 
going over to the Hermitian conjugates, 

ee' = e', 

and from this and the previous result we conclude that e' = e. 

Under these conditions the content of theorem (3.3) can be extended 
and its proof simplified. If e, are the real generating units of p, pi 
respectively, then since ei is in p e^e = and on going over to the 
Hermitian conjugates we find ee^ = Hence the idempotent element 
e, introduced by e = ej + ^2 is real and independent of ; = p^ -f pj 

is thus completely reduced into pi and an invariant sub-space p, which 
is unitayy -orthogonal to pi and which has as its real generating unit e,. 


§ 4. Invariant Sub-spaces in Tensor Space 

We now return to the investigation of tensors of order /, 
the totality of which constitutes the space W. Let tt again be 
the group of all permutations of / things and r ( = p) the corre- 
sponding group space (algebra). Let a be a symmetry quantity, 
i.e. an element of the algebra p, with components a{s) ; the 
element a is then defined by 

d{s) = (4.1) 

The relation 

F = aF, 

which asserts that the tensor F' is obtained from F by the 
operator a, is equivalent to the equation 


between the corresponding elements F and F' of the algebra p. 
For 


sF' = • stF 


is in fact obtained from 


F' - • tF - • tF 

t t 


by operating on it with the permutation 5 . 

In the following considerations, which are concerned with 
symmetry classes of tensors, p (with or without index) always 
denotes an invariant sub-space of r, e the generating unit of 
and ^ the corresponding We may then say that e is 

the generating idempotent operator of the symmetry class ^ in 
the following sense : 

(1) eF lies in F being any tensor whatever ; 

(2) if F is in ^ it is reproduced by the operator e ; eF == F. 
In this w^ay we obtain 2.* constructive definition of the symmetry 

class ^ as the totality of all tensors of the form eF. This definition 
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js considerably simpler than the original one in terms of for 
it depends on a single element e instead of a manifold p. If, 
for example, we are dealing with the class ^ of all completely 
symmetric tensors 


is such an operator ; the corresponding operator for the class 
of all anti-symmetric tensors is the alternating sum 


1 

/! 


rs.s. 

$ 


Theorem (4.2). If p' or have -0 

+ ^^2 respectively. 

We need to prove only the latter part of this theorem, 
i.e. for the case of complete reduction. The generating unit 
e = -f ^2 P has as components ej, in p 2 the generating 
units of p 2 respectively. The formula 

eF = CiF + e2F 

defines the corresponding complete reduction of ^ into the 
independent invariant sub-spaces ^ 2 - 'v 
Theorem (4.3). If p^ ~ p^ then ~ ^ 2 * 

The similarity correspondence -> of pi on p 2 is, by 
theorem (3.5), of the form 

Xj = Xi b, Xj = Xj b' . 

Hence 

F, = bF„ F, = b’F, 

define a one-to-one similar correspondence of on ^2 its 
inverse. 

Theorem (4.4). //p -3 to then p — ij^. 

The only non-trivial part of this first converse theorem which 
remains to be proved is that p ^ 1^^. All tensors of the form 
— eE^c are in where (E^) is a basis for the entire tensor 
space W ; hence all elements of the form 

y = • ' • if) • PAh • • • if) 

a, i 

are in fc|^. On introducing 

X = 2^e„(zi • • • tV) • PAii • • • if) 

fx, i 

we have y = xe. On recalling the definition of to = t]9^^ we 
see that xe belongs to 1^^ if x lies in to- But in virtue of the 
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assumption that I? -8 this is automatically satisfied if x is an 
arbitrary element of I? ; but then xe = x. Hence every element 
of ^ is contained in []ip. 

In order to formulate the converse of these theorems let 
ip (with or without index) now denote an arbitrary invariant 
sub-space of W and the corresponding 

Theorem (4.5). // ^' S ^ or ^ then -3 p, 

= ■Pi + 1^2, respectively. 

Theorem (4.6). If'^'^%’ then p p'. 

Theorem (4.7). ip = ijip. 

The last theorem is by far the most important of all ; it 
asserts that every ^ is a symmetry class of tensors. It is desirable 
to prove it first, i.e. to prove that -0 Let e again denote 
the generating unit of p ; then consists of all tensors of the 
form F' = eF. Since the element belongs to p it is necessarily 
of the form 

e{s) = e{s~^) = Ze»{ki ■ ' ■ k,) ■ sE^[ki • • ‘ kf), (4.8) 

a.k 

where the tensors constitute a basis for the space Now 
the trivial equation 

Ssc{iy • • • if) •JsF{ii • • • if) = Ec{h • • • if) • Fii^ • • • if) 

i i 

shows, on replacing sc by c, that 

Sc{ii • • • if) • sF{ii • • • if) = Es~^c{ii • • • if) • F{ii • • • if). 

i \ 

Hence we may replace (4.8) by 

g(5) =: Ese^iki ’ • • kf)' E^{ki • ’ ' kf) 

a,k 

and the coefficients of F' arc then given by 

F'ih ■ • • V) = Ec^ih ••• if ] ki ■ ■ ' kf)E„{ki ' ' ' kf) 

a,k 

where 

G(fi if] Ih • ' • kf) = EsF{ii ' • • if) • sejki • • • kf. 

8 

Because of the summation over all elements s of the group tt 
this transformation with coefficients is symmetric ; hence 
the assumption that the sub-space ^ is invariant allows us to 
conclude that F' lies in ^ if the do. But this establishes 
our theorem. 

The theorem can also be proved directly, without calling 
on the theorems of § 3, in the following way. That F is in 
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means that P{iii 2 ’ • • if) is in )) and is consequently of the 
form (2.6) : 

P{h ‘ ' • if) = 2i>«{ii ''' if \ • kf)’ • • • kf)- 

a,k 

The constitute a basis of Writing down the ^-component 
of this equation and replacing the indices • • •, if' by ii • • •, if 
we find the equation 

F{ii • • • if) = Xs"^b^{ii if ] ki kf) • E„(ki • • ' kf) 

a,k 

for the components of F. Since this holds for every permutation 
we may sum over the elements of the group and obtain 

F{ii • • • if) = 2c„{ii ••• if] ki ' ' ‘ kf)‘ E^{ki • • ■ kf), 

a,k 

where the coefficients 

c,(ti if] kf) if ] ki kf) 

are symmetric. Hence since the E^ belong to the invariant 
sub-space ^ and F is obtained from them by a symmetric 
transformation, F also belongs to 

The only part of theorem (4.5) which is not self-evident is 
the assertion that are independent. By theorem (4.7) we 

have the relations 

fp* ^ f,, Ip- ^ 

for the (invariant) intersection p* of p, and pg- But since 
^1, are independent it follows that ^p’^, and therefore p*, 
is empty. 

Theorem (4.5) shows the ^ associated with an irreducible p 
is also irreducible. Hence it follows, in particular, that the 
manifold of symmetric and the manifold of anti- symmetric tensors 
are irreducible and invariant, not only with respect to the algebra 
of symmetric transformations, but also with respect to the 
transformations induced in tensor space by the affine or unitary 
groups of transformations in the vector space Applying 
this to the 2-dimensional vector space, we see that the repre- 
sentations of c = C 2 or u constructed in III, § 5, are irreducible. 

In order to prove (4.6) we must first examine the nature of 
to (for n < f) in some detail. We call the component a(l) of 
an element a of the algebra the trace of a. Hence the trace 
of the product ab, which we call the scalar product tr{ab) 
of a and b, is 


tr{ab) - 2:a{s)b{s-^). 
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The trace of a is then tr(al) = tr(Ia) — tr(a). The scalar 
product is obviously symmetric in a and b, and the symmetric 
bilinear form tr{ab) is non-degencratc, i.c. a — 0 is the only 
element for which the equation tr(ax) — 0 is satisfied identically 
in X. 

Auxiliary theorem (4.9). is a left- as loell as right-invariant 
sub-algebra of r. tr(a6) is non-degenerate within to, i.e. the only 
element a of to ivhose scalar product with every element x of to 
vanishes is a -- 0. 

The first part of this theorem is almost self-evident. For 
if X = Fffi ’ ' • if), the element x' defined by x'{s) = x(sr) is 
F'{iy • ■ • if) where F' — rF. 

Let i be the generating unit of to, a an element of to and 
X an arbitrary element. Then since to is right-invariant ax 
is also in ty, whence 

ax — ax • /, tr(ax) = tr(a • xi). 

Now xi is in to ; hence if the scalar product of a with every 
element xi of to vanishes then tr(ax) = 0 without restriction on 
X. It therefore follows that a = 0, as asserted. 

Proof of theorem (4.6). Let £„ be a basis for and let the 
similarity correspondence of ^ on send E, into the basis E'^ 
for Let Ca{ii • • • if) be a given system of coefficients and 
write 

c = Sco.{i\ • • ' if) • E^{ii • • • if) (4.10) 

a, i 

c' = • • • if) • £l(fi • • • if) . 

Cl, i 

The desired similarity correspondence between p and p' is naturally 
to be defined by c -> c'. However, this is only possible provided 
two systems of coefficients c^{ii • • • if) which define the same 
c also define the same c' ; or a system of coefficients which 
causes c to vanish must also cause c' to vanish. 

We first remark that if a tensor F satisfies the equation 

G ^ Uc{s~'^) • sF 0 

then also 

G' = 2fc'{s-^) ■ sF 0. 

By (4.10) 

c{s-^)== Zsc^ik, • • • kf)-EJk, • • • kf), 

a, k 

whence 

G{ii • • • if) = EScJfi ‘ if ] ‘ ' kf)E,^{ki • • ■ kf) 

a k 
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where 

Ca{ii - if] ki ■ k,) = • ' • if) • sc^{ki • • • k,). 

These define a symmetric transtormation. Hence the given 
similarity transformation ^ which sends E* into EJ , sends 

G into G' . This proves our assertion that the vanishing of 
G implies the vanishing of G'. 

If c = 0 we then have 

Ec'{s'^) • sF{i, • • • i,) = tr[c' • F(fi • • • iV)] = 0 

« 

for all tensors F and all combinations of indices tj • • • if, or 
tr(c'x) = 0 for all elements x of Tq. Hence by the auxiliary 
theorem (4.9) c' — 0. 

The result of our investigalious is that there exists a one-to-one 
correspondence between the invariant sub-spaces p of Iq and the 
invariant sub-spaces ^ of This correspondence is as close 
as possible ; irreducibility, complete reduction, equivaloice and 
inequivalence on the one hand imply the same on the other. In 
particular, we emphasize the further consequence ; 

Theorem (4.11). Every invariant sub-space ^ of W, in 
particular 91-f itself, ca>i be completely reduced into irreducible 
invariant sub-spaces. 

I hope that our elementary methods liave made this corre- 
spondence quite apparent. 

It is evident a priori tliat we can completely reduce the 
modulus 1 of the algebra p into a sum Cj -f ^2 -|- • • • -f of in- 
dependent primitive idem|)otcnl elements. The formula 

F = e,F + e.F + • • • + e„E 

then gives the complete reduction of W into independent in- 
variant sub-spaces ^ 2 > ' ' each of which is generated 

by one of the idempotent operators e. ('^i consists of all tensors 
of the form CjE.) From this point of view we might consider 
as the only non-trivial result of our investigation the assertion 
that the generated by a primitive e is irreducible (with respect 
to the algebra Z of all symmetric transformations). Physically 
this means that the class of terms corresponding to such a ^ 
cannot be further divided into parts which cannot under any 
conditions interact with each other. If in spite of this there 
does exist such a decomposition it is accidental — i.e. attributable 
to the special dynamical situation in the case in question. 
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§ 5. Fields and Algebras 

We here interrupt our development in order to present an 
axiomatic treatment of the two fundamental concepts field and 
algebra ; our investigation has revealed the importance of these 
concepts for quantum theory. The physicist who is not par- 
ticularly interested in such a treatment may well omit these 
sections. 

A field is a domain of elements, called numbers, within 
which the two operations of addition and multiplication are 
defined and which associate with any two numbers a, j3 of the 
field certain unique numbers a + ^, ajS respectively. Addition 
obeys the commutative and associative laws 

a 4- ^ ^ -b a, (a + ^) + y a 4 (^ + y) 

and has a unique inverse, subtraction. From this follows the 
existence of a unique number o (zero) with the property 
a4o = o4a = a fof all a. Further, associated with each 
number a is a number — a, its negative, such that a 4 (— «) — o- 
We require that multiplication obey the associative law 

(a^)y = a(^y) 

and the distributive laws 

(a 4 fi)y ^ (ay) 4 (j8y), o.(fi 4 y) = (a^) 4 (ay) 

with respect to addition. From the distributive law follow 
the relations 

ao = oa — o. 

Multiplication need not be commutative ; in case it is we speak 
of a commutative field. Further, division by any number 
other than o shall be possible and shall lead to a unique quotient, 
i.e. each of the equations 

^ 

have for given a #= o and given jS one and only one solution 
7] respectively. From this it follows that the product ajS of 
two numbers can only be o if one of the two factors is o. As a 
further consequence, there exists a number e, “ one ” or “ unity/’ 
with the property that 

as ~ SOL = oi 

for all a. Wc explicitly assume that not all numbers equal o ; 
then in particular e o. Every number a 4= ^ possesses a 
unique reciprocal a“^ with the property aa“^ -- a"^a — e. 
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We must introduce in addition to the numbers of our field 
the ordinary numerical symbols 1, 2, 3, • • •. Tlieir inter- 
pretation as multipliers is given by the equations 

la = a, 2a = a -f a, 3a = (2a) + oc, • • •, 

in general 

(w “I" l)a = “b 


In particular we can construct the series 

le, 2e, • • •, «e, • • • (5.1) 

of multiples of 6. We then have two possibilities. (1) All the 
numbers of this set may differ from e ; then they are all different, 
and we can conclude with the aid of the equation 


ftp = fie • ^ 


and the division axiom that for a given number a there exists 

p oc . . 

one and only one number “ which satisfies the equation 

njS = a ; we can then introduce ordinary rational numbers as 
multipliers. (2) The second possibility is that one of the multiples 
in (5.1) is equal to e itself ; let the least multiple of this kind be 
pe. Then the numbers of the series (5.1) repeat in cycles of 
length p. p must be a prime number, for if p were the product 
of two integers m, n smaller than p we would then have 


o — pe — me • ne, 


but by assumption neither me nor iie are o, for pe is the lowest 
multiple of this kind, and this is contrary to the division axiom. 
In this case we are dealing with a finite field of modulus p} 

In order not to lose ourselves in too broad generalities we 
now take as our number domain a commutative field and define 
a linear associative algebra of finite order over this field. 
By number we mean the elements of the field, and denote its zero o 
and its unit e by 0 and 1 ; by element we mean an element of the 
algebra. Wc denote the former by small Greek and the latter by 
small Latin letters. An algebra is characterized by three fundamen- 
tal operations : addition of two elements, a-\-b] multiplication of 
an element by a number^ ya ; multiplication of two elements, ab. 
The first and second of these operations obey the familiar axioms 
of vector calculus (I, § 1), which we set forth here again for the 
sake of completeness. 

Addition is commutative and associative and has a unique 
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inverse, subtraction. It then follows that there exists a null- 
element 0 . Multiplication by a number obeys the laws 

la =: a, a(^c) = (a^)r, 

(a -f ^)c = ((xc) + (M> + c) = (a^) + (ac). 

The order h is introduced by the dimensionality axiom : every 
h I elements of the algebra arc linearly dependent, the co- 
efficients in the equations expressing the dependence being 
numbers of the field, but there exist h linearly independent 
elements. A set of h such elements ei, ^ 2 , * ‘ *, called “ basal 
units^'" form a basis for the algebra in the sense that any element 
a can be expressed in one and only one way in the forni 

a — -f- ^ 2^2 + * • • + 

and can be replaced by the set (aj, a 2 , • • •, cch) of h numerical 
components. 

Multiplication of elements among themselves obeys the 
distributive laws 

{a + b)c {ac) + [bc)^ c[a b) = {ca) + {cb) 
for both factors and the associative laws 

ya • b ~ y{(ib), b • ya ~ y{ba), 

[ab)c — a{bc) 

We neither assume that multiplication is commutative nor 
that it possesses a unique inverse, division. But we do assume 
that the algebra possesses a “ one,” the modulus (or principal 
unit), i.e. an element e with the property ae ea a for all 
elements a. We shall usually not hesitate to denote the zero 
and one of the elements of the algebra by 0 and 1. 

If we assume the possibility of division the algebra reduces 
to a (in general non- commutative) field or division algebra of 
finite order h over the given field. 

§ 6. Representations of Algebras 

For the sake of the printer and in order to give the text a 
more peaceful appearance we no longer emphasize the elements 
of our algebra by expressing them in boldface type. This 
applies in particular to the elements of the algebra p of ” sym- 
metry quantities ” — which we may often denote by this latter 
expression in case of possible confusion with the elements of 
the underlying group. We still employ this means of distinguish- 
ing between the tensor F and flic symmetry clement F or when 
we wish to consider an element as an operator acting on a tensor. 
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We start with an algebra p of finite order h, the elements of 
which constitute an /^-dimensional vector space r, and associate 
with the element a of p the correspondence 

(a) ; X x' ax 

of r on itself. We consider the algebra (p) of transformations 
(a), which is simply isomorphic with the algebra p, as funda- 
mental for the vector space t, i.e. the term reducible, invariance, 
etc., as applied to sub-spaces of t are with respect to the 
group of transformations (a). We assume that t can be com- 
pletely reduced into irreducible sub-spaces + 1^2 + * ‘ ‘ ^^^h of 
these sub-spaces then contains an idempotcnt generating unit 
^ 1 , ’ ’ * • We have already seen that this assumption is true 

for the algebra associated with any finite group — at least under 
the restriction that the field over which the algebra is defined 
does not have as modulus a. prime number which is a factor of 
the order h of the group. 

We discussed the representations of a group or of the corre- 
sponding algebra in Chapter III. We found that the irreducible 
representations are subject to certain important conditions 
which, surprisingly enough, limit their number and which, 
together with the as yet unproved “ completeness theorem,” 
lead to the reduction of the given algebra into independent 
simple matric algebras (III, § 13). That we were unable to 
prove the completeness theorem with the methods there em- 
ployed was to be expected, for we assumed that the representa- 
tions were given and examined their properties ; we had no 
general process for the construction of representations of the 
given algebra. But we are now in possession of the materials 
for such a construction : the reduction of r into irreducible 
sub-spaces reduces the regular representation into as many 
inequivalent irreducible representations of our algebra as there 
are inequivalent invariant sub-spaces p,. We shall now carry 
out this construction process to the point of obtaining the re- 
duction of our algebra into independent simple matric algebras ; 
it will be desirable to derive the previous results again from this 
standpoint. A further difference between this investigation 
and that of Chapter III consists in the fact that we here refrain 
as long as possible from placing restrictive assumptions on the 
commutative field over which the algebra is defined ; only at 
the end of the investigation do we discuss the advantages at- 
tributable to the fact that the continuum of complex numbers, 
the only field in which we are interested for the physical appli- 
cations, is algebraically closed. 
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Theorem (6.1). Every representation of the algebra p is com- 
pletely reducible into irreducible representations. Each of these 
irreducible constituents is equivalent to the representation induced 
in some by the regular representation, 

(Hence the complete reducibility of the given algebra implies 
the complete reducibility of its representations. Further, every 
irreducible representation is contained in the regular repre- 
sentation, which therefore constitutes an appropriate starting 
point for obtaining all representations by the method of reduction). 

Let ^ be an w-dimensional representation, and let Cj, 62 , • • •, 
be n fundamental vectors constituting a co-ordinate system 
in the representation space 91 of §. If the element a of the 
algebra corresponds to the linear correspondence A in §, we 
interpret the equation 

l' = ai as l' = Ai, 

where 5 ', j are vectors in 91. If e is a given fixed vector and x 
runs through all elements of one of the irreducible invariant 
sub-spaces p = pi of r then, as we shall show immediately, 
xt runs through a certain sub-space p(e) of 91 which is invariant 
with respect to §. Indeed, the transformation A associated 
with an arbitrary element a sends xt over into {ax)t^ and if 
X is in p, ax is also. p(e) is either 0 or is similar to p in the sense 
that different x generate different images Are, for those a; of p 
for which Are = 0 constitute an invariant sub-space p' of p, and 
in virtue of the assumption that p was irreducible p' must 

either be 0 or p itself. Hence if p(e) 0 the representation 

induced in p(e) by is equivalent to the regular representation 
in p. 

These considerations are to be supplemented by the following 
remark. If ^ is any invariant sub-space of 91 then p(e) is either 
independent of ^ or is contained entirely in for those elements 
a: of p for which Are lies in ^ constitute an invariant sub-space 
of p, which is therefore necessarily either 0 or p itself. 

Now construct successively 

Pi(ei), • • •, 

^2(62)) ■ ’ ■) 

Pll^n)) l^2(^n)i ' ' ') 

Each sub-space in this list is either entirely contained in the 
sum of the previous ones or is independent of this sum ; on 
retaining only those sub-spaces for which this latter possibility 
is realized we obtain a reduction of 91 into certain invariant 
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sub-spaces To prove this theorem we need only to note 

that the sum of the sub-spaces contained in the first row con- 
tains at least the vector Cj, that on adding to them the sum of 
those contained in the second row we obtain at least the vector 
62 in addition, etc.® 

The theorem just proved is in particular applicable to the 
symmetric group w, and we now wish to establish the analogue 
for the algebra U of symmetric transformations in the space 
of tensors of order /. We already know that W can be reduced 
into sub-spaces which are irreducible with respect to E 
(provided the number field over which E is defined does not have 
as modulus a prime ^ /). Every transformation ^ of U is at 
the same time a transformation /I, of on itself and the corre- 
spondence A -> Ai is naturally a representation of E, the 
“ representation induced in by the algebra E.” We wish to 
show that the representations of E are completely reducible 
into irreducible constituents, and that each of these constituents 
is equivalent to the representation induced in some by the 
algebra E. Naturally this does not follow immediately from 
theorem (6.1) ; in order to establish the connection between 
the two we must show that the complete reducibility of W into 
irreducible invariant sub-spaces implies the same for the 
algebra E. We apply the notation and conventions given at 
the beginning of this section to the algebra E : [A) is the 

correspondence 

S -> y - AS 

of the “ vector space ” E on itself, A -> (A) the regular repre- 
sentation of E ; the algebra of transformations {A), which is 
simply isomorphic with E, is taken as fundamental in the vector 
space E, i.e. the transformation group of E consists of the 
transformations {A). 

Theorem (6.2). Let E be an algebra of transformations in a 
vector space 91, and let 9f be completely reducible with respect to 
this system E of transformations into irreducible invariant sub- 
spaces Then E is itself completely reducible into irreducible 
invariant sub-spaces 11^, and the representation induced by the 
regular representation hi Tlj coincides i&ith {more precisely, is 
equivalent to) the representation induced in one of the irreducible 

by the algebra E itself. 

This theorem holds without any restrictions on the field 
over which E is defined. Let TI be an irreducible invariant 
sub-space of E (consisting not merely of the transformation 0), 
and let R 4= 0 be a transformation of U. There then exists 
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a vector o in SH such that Ra =t= 0. Let d be decomposed into 
its components o< in the various sub-spaces ; at least one of 
these components, say Oj = C, must be carried over into a vector 
Rc 0 by R. We now hold c fixed and let S in ^ — St run 
through all transformations of 77 ; these § then constitute an 
invariant sub-space /7(e) of ^ The “ typical reasoning ” 

already applied in the proof of the previous theorem then allows 
us to conclude that : 

(1) 77(e) is either 0 or as ^ is irreducible ; in this case 
it is necessarily for the vector Re =t= 0 belongs to 77(e). 

(2) S — 0 is the only transformation in 77 which sends e 
over into 0, for those S of 17 for which 5e = 0 constitute an 
invariant sub-space of the irreducible sub-space 77. Hence 
§ = 5e sets up a one-to-one correspondence between 77 and 

This correspondence is similar, for S' = AS implies that 
the vectors § = St, §' = S't satisfy the equation §' = A§. We 
have thus proved the second part of our theorem : the repre- 
sentation induced in IJ by the regular representation coincides 
with the representation induced in ^ by the algebra itself ; 
briefly, 77 is similar to some 

Since St runs through the entire sub-space ^ when S runs 
through 77 there exists nn E in IJ such that Et — t ] then 
£®e = e. Since the transformations E and E^ of 77 both 
associate the same image with e they are identical : E is idem- 
potent. Hence S can be completely reduced into two inde- 
pendent sub-spaces 77 -f- in accordance with the formula 

5 = 57: -f (5 - SE). 

[Cf. the proof of Theorem (3.3).] Successive application of 
this procedure leads to the complete reduction of Z into its 
constituents 77,. 

Having proved Theorem (6.2), we obtain from Theorem 
(6.1), under the same assumptions, the further theorem ; 

Theorem (6.3). Every representation of Z is completely 
reducible into irreducible representations. Every irreducible re- 
presentation of Z coincides with the representation A A i 
induced in some by the algebra Z itself. 

Theorem (6.1) yields the further (rather uninteresting) fact 
that not only is every 77, similar to some but also conversely 
every is similar to some 77,. 

As has already been indicated, all of these results are applic- 
able to the algebra of symmetric transformations in tensor space 
917, gut we have shown in § 1 that this algebra can be replaced 
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by the group {c)f induced in tensor space by the group C of 
linear transformations 

x[ “ [dct [a{ik)] 4= 0] (1.3) 

of n-dimcnsional vector space, i.e. by the representation (c)-f of 
C. We shall say. that a representation of C is of order / if the 
components of the matrix which corresponds to the element 
(1.3) of the group, are rational integral functions of the a{ik) 
of order /. Our theorem then asserts : 

Theorem (6.4). Every order represejitation of C is com- 
pletely reducible into irreducible representations^ and every irreduc- 
ible representation of order f of t is contained in the representation (c)A 
This theorem is still valid on restricting the affine group C to 
its unitary sub-group u. (Naturally the concept “unitary” im- 
plies that we are then no longer dealing with an arbitrary field, 
but are operating in the field of all complex numbers.) 

§ 7. Constructive Reduction of an Algebra into Simple 

Matric Algebras 

We again assume that the algebra p of order h, which may 
at the same time be considered as a vector space loih dimensions, 
is completely reducible into irreducible invariant sub-spaces 
The generating units ei of these irreducible are obtained by 
the corresponding reduction of the modulus ; we can then 
express an arbitrary element a; of r as the sum of its components 
in the various : 

1 X = Zxe,. (7.1) 

i t 

If q is a sub-space of r we denote by qa the totality of elements 
of the form xa where x runs through all elements of q ; with 
or without index, is an idempotent element, usually primitive ; 
p = Xe the invariant sub-space generated by ^ ^ the repre- 

sentation of p induced in p by the regular representation. 

We could consider in addition to the reduction (7.1) of X 
into left-invariant sub-spaces the analogous reduction into 
right-invariant sub-spaces by means of the equation 

^ IJe^x, 

i 

But the most complete separation into mutually independent 
components is obtained by carrying out both of these processes 
simultaneously : 

A-' = ZeiXe^ = Zxik- 

i, k i,k 


(7.2) 
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The elements of the form eiXej^ are those of character 
or briefly {ik). Let be the sub-space consisting of all elements 
of this character. The various are independent and the 
entire r is reduced into the sum of the ; the original left- 
invariant The important properties of are given 

t 

by the following : 

Auxiliary Theorem (7.3). I. If :p, p' are two inequivalejit 
irreducible sub'Spaces ivith geyierating units e^ e\ all elements of 
character (^, e) are = 0. 

II. The elements of character {e, e) constitute a field or division 
algebra which is simply isomorphic with the system of similar 
projections of on itself. 

Proof. I. Let a be any element of character (^, e). The 
transformation 

[a] : X -> x' = xa (7.4) 

carries every element x of p over into an element x' of I)' and 
defines a similar projection. Conversely, we know that any 
similar projection of p on p' is defined by an equation of this 
form, and that the generating element a of character {e, e') is 
uniquely determined by the projection. If p and p' arc irre- 
ducible our “ typical reasoning ” leads us to the two usual 
alternatives : cither the projection associates with every clement 
X of p the image x' ^ 0 or it defines a one-to-one correspondence 
of p on p'. The equation ea — a tells us that the first alternative 
is possible only if a — 0, and the second implies that p and p' 
are equivalent. 

II. The above remarks arc applicable to an element a of 
character (^, e) and the similarity projection of p on itself which 
it generates. If p is irreducible every such projection, except 
the one defined by a — 0, is one-to-one and consequently has 
an inverse. But the existence of an inverse is identical with 
the possibility of division. The isomorphism asserted in the 
theorem is apparent on reversing our usual procedure, and 
reading the resultant of two or more correspondences from 
left to right, for the resultant of the correspondences 

x' ~ xa^ x" = 

is given by 

x" = x(aa'). 

We now proceed with the help of this auxiliary theorem as 
follows : Arrange the ^3,- into classes of equivalent sub-spaces 
with generating units 

• • •, 4 : ^ 1 , • • ; • • • 
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and add together the generating units in each of these classes : 
^1 + • • • + = e , + • • • + e, = e , • • •. 


We then have 

1 - s' + 6" + • • • (7.5) 

t =. r' + t" + • • • (7.6) 

where t', t", • • • denote the inequivalent sub-spaces te', ts", • • • 
into which t is reduced. 

Part I of the auxiliary theorem above then tells us that, 
for example, 

e'Are" = 0. 


Hence the product a'a” of two elements belonging to different 
sub-spaces t', t" is always 0, and the reduction 

a = a' a” -]- • • • = as' + as" -{-••• 


leads to the multiplication rule 

ab - a'b' 4- a''b" + • • •. 

From this it follows that t' is both right- and left-invariant and 
a fortiori constitutes an algebra p' (“ invariant sub-algebra ’’) ; 
s' is the modulus of p'. The given algebra is then the direct sum 
of the simple algebras p', p”, • • •, where the precise meaning of 
direct sum is defined by the following : 

Let p' , p", • • • be algebras (defined over the same field), and 
consider as the elements of a new algebra p. the direct sum of 
p', p", • • •, all sets 

a ^ (a', a", - • -) 


consisting of an arbitrary element a' of p', an arbitrary a" 
of p", • • *. The fundamental operations in p arc defined by 


(a', a", • • •) -f {b', b", • 
A(a', a", • 
(a', a", • • W, b", • 


•) = (a' + b', a” + b", ■ 
•) (Aa'; Aa", • • •), 

•) = {a'b', a"b", • - •) 




where A is any number. 

Note that the central of the algebra p obtained by direct 
summation is the direct sum of the centrals of the individual 
algebras p', p", • • •. 

We investigate in detail one of these simple sub-algebras, 
say p', which we now denote simply by p ; its modulus e' 
may now be denoted by 1. On omitting the primes, the de- 
composition of 1 into equivalent primitive idempotent elements 
Ci is expressed by 


1 — + ' ’ ' + 
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Every element a of p is reduced in accordance with the formula 
(double Peirce reduction) 

r 

a = = Sieiae^) 

<,*==1 t , 4 ; 

into components of characters [ik). The component of the 
product c ^ ab is easily seen to be expressed in terms of the 
components ft,*, of a and b by the equation 

f 

^ik )k* 

i = l 

We have thus already obtained the connection between our con- 
siderations and the matrix calculus. 

The invariant sub-spaces |)i, :p 2 > ‘ ‘ generated by the 

^ 2 , * • *, a^e all equivalent. Let p be any of these classes, 
e.g. p = and let Pi be any fixed one-to-one similarity corre- 
spondence of pi on p. In accordance with (7.4) any element 

a — “ Ciacj^ 

of character (^,-, ej^) generates a similarity projection [a] of pi 
on pj ^ ; this projection can be written in the form 

[a] = r,arr^ (7.7) 

where a is a similarity projection of p on itself. But by Part II 
of the auxiliary theorem proved above the similarity projections 
of p on itself constitute a field (division algebra) 0 which is simply 
isomorphic with the set of elements of character (^, e). If 0 is 
of order v each of the r left-invariant sub-spaces 

Pk “ 2JPik 

i = l 

is of dimensionality g — r • v. The number of times r an irre- 
ducible representation occurs in the regular representation is 
accordingly a factor of the dimensionality g of the representation. 

Any element a can be reduced into its components 
which may be any elements of the independent sub-spaces 
In accordance with (7.7) 

[^ik] = r* iOLijeP fc ^ (7.8) 

and a^c may be replaced by the corresponding element of 
the field 0. Since conversely any such element is by (7.8) 
associated with a similarity projection [aij,] of pi on and there- 
fore with a definite element Uije of character (f^)^ we obtain a 



REDUCTION OF AN ALGEBRA 


313 


one-to-one reciprocal correspondence between the totality of 
all elements a of the simple algebra p and the totality of matrices 

ail • • • “ir 

*21 *22 • • * * 2 r 


an a,2 • • • a„ 

of order r whose components a,-* arc elements of the field 
The correspondence is such that to the three fundamental 
operations of the one (addition of elements, multiplication of 
an element by a number and multiplication of two elements) 
correspond to the same operations of the other. Note that in 
particular 

[a., •*)•*] = = Aa.irr' ■ 

= ri • 

We have thus proved : 

Wedderburn’s Theorem.^ yiny of the simple algebras, tohose 
direct sum constitutes the given algebra p, is simply isomorphic 
with a simple metric algebra in a certain field {divismi algebra) 
0 defined over the field of the original algebra. 

{Remark. The invariant sub-space consists of all elements 
a such that the matrix l|a,i.|l has as its only non-vanishing column 
the The element ef is then described by that diagonal 

matrix all of whose components vanish except the one occupy- 
ing the place, which is 1.) 

It is readily seen that the central of the simple algebra p 
consists of those elements whose matrix (7.9) is of the form 

a 0 • • • 0 
0 a • - • 0 

0 0 • • • a 

where a belongs to the central of the field 0. 

Our construction was divided into two steps. First t was 
completely reduced into the sub-spaces t', t”, • • • which are 
both right- and left-invariant and then these were further 
reduced into the left-invariant sub-spaces p,-. We must now 
return to the consideration of the first step. On multiplying 
on the left by (7.6) we find 

xz’ — e'xs', 

and on multiplying z'x on the right by the same factor 

= e'xe'. 
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Hence 

xz' = z!x ; 

the e\ e", • • • commute with all elements and belong to the central 
of the algebra. The sub-spaces t' = p' , t", • • * are both right- 
and left-invariant in the sense that neither the transformation 
x' ™ xa nor x' ™ ax leads out of them, and they are furthermore 
irreducible in this respect — indeed, it is for this reason we call 
them “ simple.” In order to show this we proceed as follows : 

(7.10) . If to is a sub-space which is both right- and left- 
invariant then cither ei is contained in to or to^* — 0. For 
to^i is an invariant sub-space of the irreducible pi and is there- 
fore either 0 or pi itself. In the second case we have 

Pi = to^i ^ to 

since to is right-invariant ; hence Ci is contained in to-' 

(7.11) . If ei is in to the same is true of any e which is equi- 
valent to e^. For the similarity projection :=== xh of on p 
associates e with some element a^ of :p, by means of the equation 
e — aj)^ and since a,- is in to e is also. 

(7.12) . If to t' then since to == not all the to^^ can 

i 

be empty, i.e. one of the e'i must occur in to- But they must 
then all occur in to, hence also e' = and consequently to “ t'. 

t 

(7.13) . Again let to be a right- and left-invariant sub-space. 
Then either to£'==t' or it is empty; in the former case e' is 
in to- It follows from 

^0 = + ^0^'" + ’ ’ ’ 

that to is necessarily the sum of certain of the spaces t', t”, * • • ; 
when in particular to is irreducible in the sense of right- and 
left-invariance it must coincide with one of the t', t", • • •. 
Hence the reduction (7.6) is unique. This further shows that 
every right- and left-invariant sub-space to possesses a generating 
unit i which belongs to the central of the algebra, and that t 
can be completely reduced into to and a supplementary right- 
and left-invariant sub-space. 

(7.14) . If p is an irreducible (left-) invariant sub-space with 
the generating unit e, then pe' is invariant, and since ))e' — e'ip 
it is either 0 or p itselt. Since 

- |)£' + ^£" + • • • 

the equation must hold for some one of the e', e", • • •, 

while for all others pe = 0. We then say that e belongs to 
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and that conversely e or p belongs to e. ^ is a sub-space of the 
right- and left-invariant t e. 

An algebra p = t, concerning which we only assume that it 
is completely reducible into irreducible invariant sub-spaces p,-, 
is necessarily obtainable by successive application of the follow- 
ing processes : 

(A) Construction of a field ; 

(B) Transition to matrices : we take as elements the matrices 
of a fixed order r whose components are arbitrary elements of 
the field ; 

(C) Direct summation. 

The processes (B) and (C) are formally completely determined 
and are therefore of an elementary character. Hence the 
construction of algebras is reduced to the construction of fields, 
i.e. of special algebras in which division is possible (“ division 
algebras ”). 

The converse is naturally also true : any algebra constructed 
by the three steps (A), (B) and (C) is completely reducible, for: 

(A) If the algebra t is itself a field, r is itself an irreducible 
sub-space of t. For if a is any non-null element of the field 
then runs through the entire field with ^ ; this is merely 
the content of the division axiom. 

(B) The matrices (7.9) in which all components of every 
column except the z*** vanish constitute the irreducible sub- 
space p,, and the space r of all matrices is the sum of these p^. 
p, is irreducible ; to show this we must prove that if a is any 
element in p, then any element of p, can be expressed in the 
form xa. a as well as a' — xa has as its only non-vanishing 
column the z*’’ ; dropping the last indc.x z, we denote these two 
columns by 

(«!, “2, • • •, ar), (a'l, 4, • • “f), 

respectively. The equation a' — xa is then 

r 

«< = ; 
t-i 

we arc therefore concerned with proving the theorem that any 
non-vanishing “ vector ” (ajaj - • • ar) can be transformed into 
any given “ vector ” (a'^a^ *•'«,) appropriate linear 

correspondence. Since not all the vanish take one of them, 
say a 2 , which does not vanish and let all for which k ^ 2 
be 0 ; f.2 is then to be determined by the equation 

a,' == ; 

that this is possible is guaranteed by the division axiom. 
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(C) The assertion is self-evident for this step. 

In general only the first step, (A), does not lend itself to an 
exhaustive formal treatment. However^ if the field over which 
the field (“ division algebra ’*) referred to in (A) is defined is 
algebraically closed this step becomes extremely simple : 

The only division algebra of finite order over ayi algebraically 
closed field is this field itself. 

Proof. Consider an algebra of order v defined over an 
algebraically closed field. If a is an clement of the algebra 
there must exist a linear dependence between the z; + 1 powers 
a^y • • •, a, 1, i.c. a linear relation whose coefficients are 

numbers of the field. Hence a satisfies an algebraic equation 
of degree nt ^ v : 

/(A) A"* + • -f ym 

f{a) ■■= a" + + * * • + ym^ = 0- 

Since the field is algebraically closed /(A) can be expressed as 
the product of linear factors : 

/(A) == (A — ai)(A — ajj) • • • (A — a„). 
Correspondingly 

{a — OLil)(a — aal) • • • (a — a^l) “ 0. (7.16) 

We now introduce the assumption that the algebra of order v is 
a division algebra ; then the product of two or more elements 
can vanish only if one of the factors is 0. Hence we may con- 
clude from (7.15) that a ~ a^l for some i ; the algebra then 
consists of the products of the modulus 1 with any number of 
the fundamental field, and therefore the algebra itself is simply 
isomorphic with this field. 

If we are dealing in the field of all complex numbers the 
auxiliary theorem (7.3) can be replaced, in accordance with 
the above, by the more definite : 

(7.3'). All elements of the form ex' e are zero if the primitive 
idempotent elements e^ e' are inequivalent. If they are equivalent 
all such elements are multiples of one of them {which is different 
from 0). 

Further : The number of times an irreducible representation 
appears in the regular representation is not merely a factor of the 
dimensionality of the representation ; it is actually equal to it. 
Our analysis has thus revealed the true source of this remarkable 
fact. 

Under these circumstances the given (“ semTsimple ”) algebra 
is the direct sum of simple matric algebras over the original field. 
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We obtain a complete set of basal units e\,^, • • • : 

« = Kke'ik + (7.16) 

ik IK 

for the algebra ; these basal units satisfy the multiplication 
law of “ matrix units,” i.e. products of the type 

= 4 (’■”) 

and all others vanish. The correspondences 

a -> llaall, a |Kj|, • • • 

are the inequivalent irreducible representations • • •. 

The basal units e"^, • • • are the generating units e'^, e’, • • • 
of the irreducible sub-spaces )),• with which we began our con- 
struction. is the element of character {ik) generated by 
the correspondence -TjE-i of on pjt, i.e. that element which 
this correspondence associates with e[. 

After having obtained the irreducible representations in 
this constructive way we derive their orthogonality properties 
again from our present standpoint. For the moment let the 
trace of a denote the trace of the correspondence 

X -> y ~ ax (’^• 1 ®) 

of t on itself which is associated with a in the regular repre- 
sentation. In terms of the co-ordinate system defined by the 
basal units above this correspondence becomes 

Vik = E^i/jk, • • 

r=i 

Each of the g' columns of variables 

^'ik, ^'ik, • ’ •, ^’g'k = 1 , 2 , • • g') 

undergoes the transformation with matrix ||ay|| ; the trace of 
a is accordingly 

g' 

g' • E<^u + • • •• 

< =1 


By (7.16) this is equivalent to the equations 
,r (4) =/'>(•+'<) ... 

le' (' = ky 

for the basal units. Hence by (7.17) 

tr = g', • • • 


(7.19) 
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and all other types of products of basal matric units have a 
vanishing trace. 

If the algebra is the algebra of a group of order h the corre- 
spondence (7.18) is expressed in the original co-ordinate system, 
consisting of the elements s associated with, the elements ^ of 
the group, by the equation 

y(-s) = Za{st-^)x{t). 

t 

From this it follows that the trace, as defined above, of a is 
equal to h • a(l) ; but in the case of a group algebra we have 
previously called a(l) itself, without the factor /t, the trace of a. 
On returning to this original definition of the trace we need 
merely to replace the right-hand side g' of the orthogonality 
relations (7.19) by g'jh. Equation (7.16) may now be solved 
explicitly for the coefficients : 

»,'* = 4tr {ae'ti) = ^ • Za{s) • (7-20) 

6 Si 

The connection with the development in Chapter III, § 13, is 
obtained by noting that the 

tUs) = yre',ds-^) (7.21) 

s 

are the components of the matrix U'{s) associated with the 
element s of the group in the irreducible representation f}'. 
The character of 1^' is therefore 

X'(5) = ^ • s'(j-) (7.22) 

and (7.19) yields the orthogonality relations for the representa- 
tions. 

We have thus arrived at a constructive formulation of the 
theory, in which the fundamental concepts involved in and the 
range of validity of each step are clearly apparent. It supplies 
us with a constructive method for obtaining a complete set of 
irreducible representations, as well as establishing the ortho- 
gonality relations. 

Additional remark. In dealing with the continuum of all complex 
numbers and a group algebra defined over this field we can, in accord- 
ance with the remark at the end of § 3, completely reduce the modulus 1 
into real primitive e^ and the space r into the corresponding unitary- 
orthogonal irreducible v,-. Further, the projections r,. can be normalized 
in such a way that is conjugate to To show this we note that 
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the conjugate of is under these conditions an element of character 
(ki) and must therefore be the product of by a number : 


The rules 


"" yxk * ^iti- 







(7.23) 


yield the conditions 

YikYki = Yu , Y,i = 1 


on the coefficients. Further, is real and positive, for from (7.23) 
and (7.19) we find 

= = r-nt- 

8 

We then find that the y-j^ can be brought into the form y.^ = where 
the p^ are positive real numbers (take, for example, pf == y^-). On re- 
placing the original correspondences /V by p-P. we find that the new 
is actually conjugate to the new Our representations l/, % ^ ^ are 

accordingly thrown into unitary form. 


B. Extension of the Theorv and Physical Applications 

§ 8. The Characters of the Symmetric Group and 
Equivalence Degeneracy in Quantum Mechanics 

The notation employed in this section is as follows : tt = tt/ 
is the symmetric permutation group of / things, r ~ p = (tt) 
the corresponding algebra, e a (primitive) idempotent element 
of p, p = xe the (irreducible) invariant sub-space of r generated 
by \) the representation induced in \> by the regular repre- 
sentation, g the dimensionality of p and x character of 
1), e that element of the set e', s", • • • (7.14) to which the irre- 
ducible p belongs ; ^ the corresponding symmetry class of 

tensors of order /, consisting of all tensors of the form § 
the representation'of the algebra Z of symmetric transformations 
(and therefore of the linear group c) which is induced in by 27 
itself. When further differentiation is necessary, we also denote 
this § by .^(x) or §rj(x)* considerations are valid 

for an arbitrary finite group tt, h denotes the order of n (=/! 
for TT/). 

Determination of the Group Characters, 

We begin by calculating the character of the representation 1^. 
To this end we construct the trace of the linear correspondence 

( 8 . 1 ) 


X y = ax 
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of on itself ; the considerations of the previous section show 
it to be 

Ia{s)x{s). 

8 

Now consider instead of (8.1) the projection 

X ^ y = axe (8.2) 

of the total space t on p ; it coincides with (8.1) within p and 
sends any element a; of r into an element y of p. On choosing 
the co-ordinate system in t in such a way that the first g funda- 
mental vectors span the sub-space p, the last h — g rows of the 
matrix of (8.2) consist only of zeros ; hence the trace of the 
projection (8.2) of the total group space is equal to the trace of 
the correspondence (8.1) in p. In terms of components equation 
(8.2) is 

y{s) = Za{l)x{s')e{t'), {ts'f = s) 

and the trace is therefore 

I2:a{l)e{l') 

8 

where the inner sum is extended over the pairs /, t' of elements 
of the group which satisfy the equation tst^ — 5 , or explicitly, 
the trace is 

l 8 

Hence the character x of ^ is given by 

XW == Zeis-H-^s) 

8 

or 

x(^) = ^e{rs-^r-^). (33) 

In particular, the dimensionality g of the representation (and 
the space p) is 

X(l) = h-e{\). 

Resonance or Equivalence Degeneracy, 

The significance of our results for quantum mechanics, as 
first recognized by Wigner^ is the following.’ The complete 
reduction of the tensor space W into invariant sub-spaces 
implies a separation of the terms of the physical system //, 
consisting of / equivalent individuals I (electrons), into sets of 
terms which no dynamical influence whatever can cause to 
enter into combination with each other. We have further seen 
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that the reduction of into the parallels the complete 
reduction of the total group space t of the symmetric permutation 
group TT into invariant sub-spaces Hence there is a system of 
terms associated with every irreducible representation of it — 
which' wc denote simply as the term system x> using the 
character x of as a name for the system — and the multiplicity 
of this term system is the number m(x) of times that f) occurs 
in the regular representation. This suffers a slight modification 
in case n <.f, for we must then ignore all p,- which are not con- 
tained in = l|SRA But since to is both right- and left-invariant, 
all sub-spaces which are equivalent to an irreducible invariant 
p lying in to are also in to- Hence the multiplicity of the term 
system x is rn{x) or 0 according as that e with which the character 
X is associated by (7.22) is in to or not. From the physical 
standpoint, the only additional fact of interest obtained from 
the more extended theory built up on the assumption that the 
number field in which we are operating is algebraically closed 
is that then the multiplicity m{x) is equal to the dimensionality 
g of the representation 1^. Furthermore, it is impossible to 
resolve this multiplicity by any physical means whatever, for 
corresponding terms in these various term systems remain in 
coincidence under all dynamical influences. 

We consider the resolution of terms in the case in which the 
interaction between the / individuals is expressed by a small 
perturbation energy XW, neglecting higher powers of the small 
parameter A. Assume for the moment that the energy levels 
■El, Ea, ‘ of a single individual I are non-degenerate. On 
neglecting the perturbation H possesses energy terms of the type 

E = El + Ea + • • • + E/ ; (8.4) 

we first concern ourselves with such a term. Its multiplicity 
is /! and the corresponding co-ordinates in tensor space are the 
coefficients F{ii, t’a, • • • if) whose indices are any permutation 
5 of 1, 2, • • •, /. This coefficient E(tita • * • t'/) is the component 
;t:(^) of the element 

X = F(l, 2, • • •, A) 

of the algebra (tt). The separation of the term (8.4) is to a first 
approximation determined by the reduction of the correspon- 
dence 

’ • ’ V) = '■'*/; KK • • • kf)F{kiki ’ ’ ’ kf) 

<t) 

to diagonal form ; here the matrix of the coefficients a represents 
the energy and t’l, t’a, • ' *, if ; kt, ’ ’ kf are permutations 
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s, / of 1, 2, • • •, /. This equation may therefore be written in 
the form 

x{s) = Za{s, t) x{t). (8.5) 

I 

The equation 

a{ii’ • • • if ' ; /?,-••• kf') = a{i^ ‘ ‘ ‘ if ; ki ‘ • kf) 

describing the symmetry of a, in which 

• • •, /->/’ 

s any fixed permutation r, is expressed by 

a{sr^ tr) = a{s, t) 

for the only coefficients in which we are here interested ; r is here 
considered as applied to the indices 1, 2, • • •,/ themselves rather 
than the sub-indices. Hence a{s^ t) depends only on sr^ : 

a[s, t) = a(sr^), 

and equation (8.5) may now be written in the abbreviated form 

(a) : X = ax (8.6) 

where a, x are the symmetry elements of the algebra (tt) with 
components a{s), x{s), x{s). 

On restricting ourselves to an invariant irreducible sub-space 
^ of the system space the element x of (tt) lies in the corre- 
sponding p. The g terms which (8.4) is 

resolved by the perturbation and which belong to the term 
system x under consideration are, to the approximation involved 
in the perturbation theory, the characteristic numbers of the 
correspondence (8.6) of on itself, . The sum of these terms must 
therefore equal the trace of this correspondence, or 

l^a(s)x(s). (8.7) 

8 

The sum of the squares of these terms, of their third powers, 
etc., are obtained by reiterating the correspondence (a), i.e. 

w\ + wr,+ . . . + it ; = ( 8 . 7 ') 

8 

where the ^1(5) are the components of the symmetry element 
: 

^0(5) — 1 or 0, according as 5 = I or =t= I. ) 

«t+i(5) = Za,{sr^)a{t). / 

t 

As soon as the “ exchange energies ” a{s) are known we can 
apply this formula to calculate those of the terms arising from 
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(8.4) which are contained in the term system x ; for this we need 
only to know the character x — it is not necessary to have an 
explicit expression for the idempotent generator e or the 
representation of tt. 

These considerations are immediately applicable only if we 
ignore the spin phenomena. If we take into account the per- 
turbation due to the interaction of the electrons before that 
due to the spin, as in the case of normal term order, the mere 
existence of spin implies that each of the energies Ei is at least 
two-fold. We shall later concern ourselves with the far-reaching 
modifications caused by the spin and by the Pauli exclusion 
principle, which enables us to discard the majority of possible 
terms. 

The unperturbed If will have, in addition to terms of the 
type (8.4), terms in which groups of two or more summands 
appear with the same indices. The multiplicity of the term 

fiEt+hE, + •••+/.£, (A +/2 + •••+/. =/) (8.9) 


with integral non-negative weights /< is but 



( 8 . 10 ) 


The corresponding tensor coefficients a;( 5) are those obtained 
from 


i^(l 1 • • • ; 2 2 


A A 

by the permutations s of the / arguments. But a permutation 
p is without effect if it only permutes the first fi indices among 
themselves, the next /a among themselves, etc. ; we may no 
longer distinguish between the permutations s and ps — they 
must be considered as giving rise to but one component. Such 
permutations p constitute a group n = 7r(/i, /a, • • •) of order 
K = /i!/a! • • •, and two permutations 5, t are to be considered 
as the same if they are left-equivalent with respect to this sub- 
group tt', i.e. if 5 ^ {ps = tj where p is an element of tt'). The 

only elements x of the algebra (tt) in which we arc now interested 
are those which satisfy the equation 

x{t) = Ji;(5) when t = s (mod. tt') ; 

they constitute a linear sub-space t' == r(7r') of dimensionality 
(8.10). More precisely, r' is a right-invariant sub-algebra, for 
if .y = / then also sr = tr. Again a{s, t) == a{sr^) ; further 

a{ps) = a{s), a(sp) = a{s) 

if p is in tt'. 
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We are now concerned with the correspondence at x in t' : 

x{s) = Za{sr^)x{t) (mod. tt'), (8.11) 

( 

where the “ mod. tt' ” indicates that both s and t run through 
a complete set of elements of the group which are inequivalent 
mod. tt'. As X runs through t', xe generates a sub-space p' of t' 
which is transformed into itself by the correspondence (8.11), 
and the reduction of this correspondence of p' into diagonal 
form yields those terms arising from (8.9) and lying in the term 
system x- The trace of (8.11) in p' is equal to the trace of the 
correspondence A« ; a; ^ :c in t' which is obtained from (8.11) 
by replacing x by xe, i.e. x{t) by 

£x{tr~'^)e{r) = I!x{r)e{r-H). 

r r 

Hence 

tr(A,) = i: {a{sr^)i:e{r-H)}. 

g, t mod. n' r = 8 

Since a{sr^) = a{rt~^) when r ^ s (mod. tt'), this trace may be 
written 

Z Za{rneir-H). 

t mod. 7i' r 

Naturally this sum does not depend on which particular clement 
t we have happened to choose from the set of group elements 
which are equivalent mod. tt' ; hence on dropping the restriction 
on the range of t the above sum is multiplied by the order h' 
of tt' : 

tr(Ae) = = ~Za{s)xis). (8.12) 


Here again xi^) is the character of ^ as determined by (8.3). 
In particular, the dimensionality of p' , i.e. the number of terms 
in the system x arising from (8.9), is obtained by replacing the 
symmetry element a in (8.12) by the element ao defined by 

ao(5) = 1 or 0, according as 5 s | (mod. tt') or not ; 


this number is consequently 



(8.13) 


We express this result, the validity of which is not restricted 
to permutation groups, in the theorem : 

Let tt' be a sub-group of tt of order W and let be a left-invariant 
sub-space of the group space r of tt. Consider the elements x of 
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the algebra (rr) which satisfy the condition a;(5) = x{i), where s and 
t are any two elements of the group n which are left- equivalent 
mod. -n' ; the elements of (rr) which are of this type and which 
lie in constitute a linear sub-space whose dimensionality is given 
by (8.13), where x is the character of the regular representation in p. 

The sum of the terms is equal to the trace (8.12), and the 
sums of their powers are given by 

The only way this result differs from (8.7') is by the introduction 
of the denominator f\\f^- ‘ ' and the fact that aj{s) is now defined 
by 

ri{s) — (mod. rr'). 

t 

Degenerate Case, Denote the numerically different energy 
levels of the individual I by E\ • • *, and the multiplicity 
of £^*'1 by Hy, We now distinguish between the various variables 
having the same “ principal quantum number v by an “ auxil- 
iary quantum number ” ky which assumes Uy values. An energy 
level of the type 

£' + £" + ••• + £(/) (8.15) 

of the unperturbed total system U has the multiplicity 

/!ni «2 • • • n,, 

and the corresponding tensor coefficients are those obtained 
from those of type 



by any permutation s of the /pairs (r|/{) of arguments ; we write 
instead 

x[s\kik 2 ' ’ ’ kf) or briefly ;r(5|/e). 

Similarly the coefficients of the energy matrix are denoted by 

a{s\kfii • • • kf] ’’’ If) = a{sr^\k ; /). 

The energy levels W arising from (8.15) by the perturbation 
and lying in the term system x to a first approximation, 
determined by 

2:iV- = ZZafslk-, k)x(s), 

(*) » 


( 8 . 16 ) 
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where a^islk ; /) = 1 or 0 according zs s — I, k = I or not, and 
the composition is defined by 

arn{s\k] 1) = larisr^lk ] m)a{t\m; 1). (8.17) 

i,(m) 

If the unperturbed energy level is of the form 

• • (/'+/" + • • •=/) 

the tensor coefficients in which we are interested are those ob- 
tained from 

,1 1 ... 2 2 -y 

\kii ki2 * * * ^21 ^22 * * * * • •/ 

f ^ ^ 7'' 

Let exactly f[ of the auxiliary quantum numbers kiy{p = 1, 

• • •, /') have a certain value a different value k 2 f etc. ; 

/i + /2 + * ’ ‘ /i> / 2 » ‘ * have the analogous 

meaning for the quantum numbers = 1, • • •, /") associated 
with the principal quantum number 2, etc. Then those per- 
mutations p which leave the above tensor coefficient unchanged 
constitute a certain sub-group depending on the distribution 
of auxiliary quantum numbers of the group tt' introduced in 
the non-degenerate case above; the order of is [k] == 

• • • /i! ‘ * *. a{s\k] /) is unchanged when 5 is multiplied on the 
left by an element of and on the right by an element of 7r|. 
The formula (8.16) now becomes 

= z[^^ZaMk ; %(^)} (8.18) 

aQ{s\k ; /) “ 1 or 0 according k = I and s = I (mod. 
or not, and in the composition rule (8.17) we first sum with 
respect to t mod. and then over the various possibilities 

m = (wji, Wi2, • • • : • • • ; • • •). 

In every case we obtain explicit expressions for the sums of 
the various powers of the perturbed energy levels in terms of 
the character x of the term system under consideration and the 
exchange energies a{s). 

§ 9. Relation between the Characters of the Symmetric 
Permutation -and Affine Groups 

The thorough correspondence existing between the repre- 
sentations of the symmetric permutation group ttf and the 
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representations of order / of the linear group c must lead to 
a simple relation between the corresponding characters. In 
dealing with the linear group it suffices to consider only the 
“ principal transformations 

Xi -> SiXi {i = 1, 2, • • •, n) (9.1) 


of the vector space 9ft„, for any linear transformation is 

conjugate within C to a principal transformation — except for 
those cases in which two or more of the characteristic numbers 
£, coincide. Furthermore, if we restrict ourselves ab initio to 
the unitary group U — the one in which we are interested in 
physics — the result is valid without exception and the £, are 
complex numbers of unit absolute value. The problem here 
proposed is identical with that of investigating the distribution 
of the terms of V among the various term systems x 
absence of interaction between the various individuals and when 
the single system I is non-degenerate, for on choosing a Heisen- 
berg CO- ordinate system Xi in the system space of / (i.e. one in 
which the operator representing the energy of 1 is in diagonal 


form) the variable x^ assumes the multiplicative factor e 



in time t. 

We denote the characteristic * of the representation ^ of 
the linear group whose substratum consists of all tensors of the 
form eF by X{S) or X(ei, £ 2 , • • £„) where the element 5 of c is 

the principal transformation (9.1). The £1 arc to be considered 
as 71 independent variables. The transformation of tensor space 
associated with (9.1) consists in multiplying the coefficient 
’ *, h) of fho tensor F by £,^ • £,^ • • • £,^. The sum of 
all these multipliers, extended over all linearly independent 
coefficients of a general tensor of the form F' — eF, is the desired 
characteristic. A component in which /j of the arguments i are 
equal to 1, /2 are equal to 2, • • - is multiplied by £{* • • • • £{”. 

But the number of linearly independent components of F' of 
this type is, by equation (8.13), 




(9.2) 


here x is the character of the representation f) of tt/, the sum 
being extended over all elements s of the group tt' ~ 77 ’(/ i , / 2 , * • •) 
which permutes the first numerals among themselves, the next 
/2 among themselves, etc. That this number (9.2) depends only 


♦ We prefer, foi the sake of clarity, hereafter to employ the word 
characteristic " for continuous and “ character " for finite groups. 
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on the character x is a fact of greatest importance for our present 
considerations. The result is ® 


X(£i, £2, 


= E 

fi.f,. • 






Ixis) 


(9.3) 


where the inner sum is extended over all the elements s of 
7 r(/i, / 2 , • • •). We denote the value of the character x feu* 
element s belonging to the class f of conjugate elements of 
TTf by x(f) ; our formula may then be written 


= r{x(() 


E 

ft. ft. ■ 


^fJt 


(k) 


m 




£i‘ 




(9.4) 


where . . . (k) is the number of elements of 7 r(/i, /g, * • •) 

belonging to the class f. This number can be evaluated in an 
elementary manner. 


Distribution of Permutations in Classes. 

Any permutation 5 is a product of cycles, no two of which 
contain a common numeral. The 5-term cycle (1 3 7 2 4) is 
a permutation which sends 1 into 3, 3 into 7, 7 into 2, 2 into 4, 
and 4 into I again ; writing these 5 numerals at equidistant 
intervals on the rim of a wheel, this permutation may be con- 
sidered as the rotation of the wheel about the angle 27r 5. Given 
any permutation, for example 

12 3 456789 

(9.5) 

347 19826 5, 


the cycles may be separated out by first determining the number 
(3) into which I is transformed, then the number (7) into which 
3 is transformed, etc., until a number is obtained which has 
already appeared in the cycle ; this number can, of course, 
only be I. After separating out the first cycle the remaining 
numbers can be handled in the same way, and the process may 
be continued until the desired result is obtained. The per- 
mutation (9.5) is, in terms of its 3 cycles, 

(I 3 7 2 4) (5 9) (6 8). (9.6) 

The reduction of an arbitrary permutation into its cycles is 
obviously unique. This way of writing the permutation enables 
us to tell at a glance whether two given permutations are con- 
jugate in TTf or not, for an element conjugate to (9.6) is obtained 
by replacing the numbers 1, 2, 3, 4, • • • by the same numbers 
in any order. The class f to which an element 5 belongs is thus 
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determined entirely by tlie number of cycles and the number 
of integers they contain ; in particular, any permutation s and 
its inverse belong to the same class. We denote the class 
f whose elements s consist of cycles with one numeral, with 
two, 1*3 with three, • • • by (zi 1*2 • • •) and write x(f) = x(l H ’ * *) *> 

naturally 

L*i + 21*2 + 3z3 + * * * — /• 


The number K of classes is the number of solutions of (9.7) with 
non-negative integers 2 * 2 , ^ 3 , • * *. 

The number of elements in the class f — ( 21 ^ 2^3 * ' •) is 


«(i) = 


/! 

Ti Zj! 2‘t 3‘a Z 3 ! * ' *. 


(9.8) 


To show this we write the f integers 1, 2, * * *. / in any of the 
/! possible orders and divide off each of the first ii integers by 
parentheses, then divide off the next 2/2 in groups of 2, the next 
3 z 3 in groups of 3, • • *. The symbol so obtained is to be inter- 
preted as the expression of permutation in terms of its cycles. 
Each of the /! possible arrangements so obtained leads to a definite 
element 5 of the class f, and all such elements must be included. 
We must now investigate how often the same 5 occurs among these 
/!. Now the 5-term cycle (1 3 7 2 4) can also be read as (3 7 2 4 1), 
(7 2 4 1 3), etc. : the particular integer with which we begin is 
immaterial ; such a cycle will occur five times. Hence those 
1*1 2‘* 3*3 •• • arrangements which differ only by a cyclic per- 
mutation of the numerals in each cycle are all associated with 
the same element s. Furthermore, the Zj 1-term cycles may be 
written down in any order, the 2- term ones in any order, etc., 
and these * * * arrangements all lead to the same element s. 
Hence each element occurs exactly I'l i^}. • • • times, and the 
total number of elements in the class is accordingly given by 


(9.8). 


We must also determine the number of elements of f which 
are contained in the sub-group 7 r(/i, ( 2 ^ • * *). For this purpose 
we divide the numbers from 1 to / in sections of lengths /i, 
/ 2 , • • • and consider only those permutations s which permute 
the numbers of the first section among themselves, the numbers 
of the second among themselves, etc. On dividing 6 ' into cycles 
as in the above some of the cycles will be contained in the first 
section, i.e. will consist only of numerals belonging to the first 
section, some will be contained in the second section, etc., and 
no cycle will consist of numerals belonging to different sections. 
Denoting the number of 1 -term cycles contained in the first 
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section by in, the number of 2-term cycles in this section by 
etc., whence necessarily 

1*11 + + 3ii3 -f- ' * ' = fi, 


the number of permutations of 1, 2, • • •, satisfying this 
requirement is, by (9.8), 


/i! 


*11 • *12 • 


12 - 


I'll 2‘1 


(9.9) 


Proceeding analogously for the 2"**, 3'''*, etc., sections, the number 
of permutations in • • •) satisfying all our requirements is 

given by the product of all numbers of the form (9.9) for the 
various sections. But such an element is a member of the 
class f = (iii .2 • • •) if and only if 

— *1, X**2 = *2i ’ ■ ■ ; (9.10) 

a a 

hence 



(«) « 


where the sum is extended over the various solutions of equations 
(9.10) and 

— A) ' ’ 


The inner sum in (9.4) is accordingly 




• • 1 

ll*«i! 


1 


(t) 


the only restriction on the sum being the conditions (9.10). Let 


£i + ^2 + 

+ 


+ £n, 

I 2 

+ Sn, 


Our results can be expressed entirely in terms of these sums of 
powers, for by the multinomial theorem 
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where the variables Li,L 2 , ‘ ' over which the sum is extended, 
are subject to the restrictions (9.10). We thus finally obtain 
the simple formula 


(9.11) 


We have so far made use only of the elementary connection 
between the groups n and c. If we now introduce the assumption 
that the number field over which our algebras are defined is 
algebraically closed, and is in particular the continuum of all 
complex numbers, the primitive characters of the finite group it 
have the orthogonality properties 

f 

i:«(?)x(f)x'(^‘) = 0 (x + x')- 

f 

Furthermore, the number of primitive characters is equal to 
the number K of classes. The above relations assert that the 
matrix of the x(f), where x runs through the entire set of primitive 
characters and f all classes, has as its reciprocal the matrix 

\ ■ «(t)x(l-). 

Hence we also have 


rx(()x(e‘) = 
ix(l')x(|-‘) = 0 for r + (. 

X 

This is, in fact, merely an alternative form of the completeness 
theorem. In dealing with the symmetric permutation group tt/ 
f~r = I and the order is A = /!. 

On multiplying the expression (9.11) for the primitive 
character X by ' ' ') und summing over all the primitive 

characters x of we obtain, with the aid of the relations 

derived above, the important formula 


ffUcU • . . = ExihH ' ' •)X(£i> Sj, • • •, 6„) 

X 


(9.12) 


where x aud X are the characters of corresponding irreducible 
representations of tt/ and c„. 
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§ 10. Direct Product. Sub-groups 

Programme, 

If two atoms or ions with /i, electrons, respectively, come 
together to form a molecule we may to a first approximation 
neglect the interaction between the two atoms so long as the 
distance between them is relatively large. In this approximation 
the two kinds of electrons are dynamically different, for the 
electrons of each atom arc influenced only by the nucleus and 
the remaining electrons of the same atom. The symmetry is 
therefore described by the sub-group tt' of the symmetric group 
TT — 77/ of / = /i -f A things in which the first A and the last A 
things are permuted among themselves. A similar situation 
arises when three or more atoms come together to form a 
molecule. These considerations immediately suggest the follow- 
ing problems. 

I. The theory developed in §§ 2-4 is to be extended to the 

case in which the symmetric permutation group is replaced 
by any permutation group 77 '. Naturally the definition of a 
symmetric transformation in tensor space is to be adapted to 
the new situation : we require only that the coefficients 
a(zi ••*!/; • • • kf) of ( 1 . 2 ) remain unchanged under an 

arbitrary permutation belonging to the group tt' of the sub-indices 
1 , 2, • • *, /. We say that these transformations are symmetric 
with respect to tt' ; they constitute an algebra U' which is 
obviously more extensive than 27. — This question is immediately 
settled by the remark that all our previous deductions are valid 
for an arbitrary permutation group 77 '. Here 77 ' is considered as 
an independent group rather than as a sub-group of the sym- 
metric group. 

II. Let the set of integers from 1 to / be divided into two 
or more sub-sets. We consider, as an example, the case of 
two sub-sets : the “ red ” numerals from 1 to A the “ green 
ones from 1 to A I A + A “ /• Let 77' consist of all permutations 
of the red among themselves and the green among themselves. 
Hence a permutation s' ~ (^i, ^2) of 77' consists of a permutation 
Si of the A red numerals and a permutation ^2 of the green ones ; 
77' is the direct product x 772 of the symmetric group tti oi f i 
and 772 A things. Or conversely, this direct product — the 
abstract definition of which has nothing to do with the group 
of permutations of / things — may be considered as a sub-group 
77' of the symmetric group of / — A + A things on arranging 
the sets of numerals, on which permutations of ttj, 773 act, one 
after the other to form a single set. But here we are interested 
in the following problem (which can be proposed for arbitrary 
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-finite groups) ; to discuss the properties of a group tti X TTg 
which is the direct product of two finite groups ttj, 

III. In order to discuss the structure of molecules we must 
eventually take into account the interaction between the various 
atoms or ions contained in the molecule. This means that we 
must finally return from the sub-group tt' to the full symmetric 
group TT, so we must examine the relations existing between the 
group rr and its sub-group tt'. Here again the problem is not 
restricted to permutation groups. 

Direct Product, 

Let TTi, 772 be two finite groups of orders /i, respectively. 
The elements of the direct product tt — X TTg are the pairs 
(^1, ^2) consisting of an element Si of and an element .^2 
TTg. An element of the algebra of tt is accordingly a function 
^(^1, -^2)) follows from this that the algebra of tt is the 

product of the algebras (tti) and (772) : 

(77) --- (tt,) X (wj) 

in the sense of the x -multiplication of vector spaces introduced 
in II, § 10 . An element of (771) and an element X2: 

^2(^2) (^2) yield the element x Xi X X2 of (77), whose com- 

ponents are given by 

^2) = • ^2(^'2)- 

Indeed, given any two algebras , p2 , their direct product 
p “ Pi X p2 can be constructed and multiplication in p defined by 

X Ci(^{bi X ^^2) “ (^1^1 X ^2^2) 

whether they are group algebras or not. 

If is a linear sub-space of — p^ (a 1 , 2 ), an element 
X : x{Si^ S2) of (77) is in p = Pi X p2 if if it belongs to 

Pi when considered as a function of Si, holding .^2 fixed, and to 
p2 when Si is held fixed ; indeed, any element of this kind can 
be expressed as a linear combination of products of the form 
ai X ^2, where Oi is in pi and ag in p2. If p^(a — 1 , 2 ) is an 
invariant sub-space of generated by the idempotent element 
e^ and the representation space of the representation of p« 
induced in p« by the regular representation, then p is also 
invariant, has as generating idempotent element ^ — ^1 X ^2 
and is the substratum of the representation pi X p2 <^f P- If is 
evident that the equivalences pi ~ pi, p2 ~ p2 imply the equi- 
valence Pi X p; X p 2 . 

Suppose the two pa considered above are also irreducible 
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with respect to their algebras the question then arises as 
to whether pj X p 2 irreducible (with respect to p) and whether 
p = Pi X P2 is equivalent to p' = pi X p 2 (pi irreducible) only 
if P'l, ^^2 ~ P' inequivalent if exe' — 0 

identically in x, i.e. if the sub-space consisting of elements of 
character (e, e') contains only the element 0 ; here e = Oi X e^, 
e' = e'l X e'i. Now the formula 

(^1 X ^2)(^l ^ ^2)(^1 ^ ^ 2 ) ^ ^2^ 2^2 

shows immediately that the sub-space {e, e') is the direct product 
of the two sub-spaces (ei, e'l) and (e^, ^ 2 ), and can consist merely 
of 0 only if one of these two sub-spaces consists merely of 0, 
i.e. only if pi is inequivalent to pi or pj is inequivalent to pi- 
Our second question is thus answered in the affirmative — regard- 
less of the nature of the field over which the algebras are defined. 

The first question is answered in the affirmative in III, § 9, 
for the only case of physical interest, i.e. that in which the field 
is algebraically closed. If we are more interested in the re- 
duction of the algebra than in the representations we can argue 
as follows. The algebra of elements of character {e, e) is the 
direct product of the field {division algebra) 0i of elements of 
character {ei, ei) in and the field ^2 of character (^ 2 . ^ 2 ) Pz- 
Assuming the original field is algebraically closed, all elements 
of 0^ are multiples of e^ and consequently all elements of p 
with character {e, e) are multiples of e. This proves the irre- 
ducibility of pj X p 2 . If, however, the original field over which 
the algebras are defined is not algebraically closed our assertion 
is correct only if the direct product 0i X 0^ of the two fields 
is again a field, and this is by no means always the case. But 
in any case the question concerning the nature of the direct 
product of algebras is, as in the question concerning the structure 
of an algebra in § 7, reduced to the analogous problem for fields 
(division algebras). 

Again taking the fundamental field to be the continuum of 
all complex numbers, the complete reduction 

• k 

into irreducible invariant sub-spaces p^, has as a consequence, 
in accordance with the above, the reduction of r = tj X 12 >>^^0 
invariant irreducible sub-spaces p^/) X 

Sub-groups. 

Let tt' be a sub-group of the given finite group tt. An element 
x' of the algebra x' = p' = (tt') of n' consists of components x'{s') 
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associated with the various elements s' of tt'. However, such 
an element can, and in the following will, at the same time be 
considered as an element of the algebra p == {rr) ] we need only 
to define the components x'{s) associated with elements s oi rr 
which are not contained in tt' as zero. This disturbs in no way 
the addition and multiplication of elements of (77') with each 
other or with arbitrary numbers of the field. An element x of 
(tt) “ belongs ” to tt' or “ lies ” in {tt') if and only if all com- 
ponents a:( 5) associated with elements s of the group that are 
not in tt' vanish. 

An irreducible invariant sub-space p' of x' is generated by a 
primitive idcmpotent clement e' and is the substratum of a 
representation f)' of tt' induced in p' by the regular representation. 
On reducing the modulus 1 of tt' into independent primitive 
idempotcnt elements 

+ - • • ( 10 . 1 ) 

a certain number, say g, of elements e[ will appear which are 
equivalent to e' ; the sub-spaces p'^ which they generate are all 
equivalent to p' and the regular representation of rr' contains 'i)' 
g' times. Equivalent summands are added together into 
such partial sums. Considered as an element of the total 
algebra p ~ (tt) e' is, however, in general reducible into inde- 
pendent primitive idempotent elements : 

e + - • • ( 10 . 2 ) 

Here again equivalent summands on the right are collected 
together into partial sums ; let the in the first such partial 
sum generate the representation p of tt — we shall in the following 
be interested only in these. Let the sub-space p with the 
generating unit ^ be a representative of the sub-spaces p^ gener- 
ated by the e,,. The elements of (tt) of the form xe' constitute 
an invariant sub-space <p'> which is the substratum of a re- 
presentation \f)'> of TT induced in p' by the regular representation 
of TT. Our formula asserts that cm reducing into its irre- 
ducible constituents 1} occurs exactly b times. 

In order to obtain a simple characterization of the elements 
of <p'> we divide the elements of the group tt into sets of group 
elements which are equivalent mod. tt' ; the such class 
consists of the group elements where 5 ' runs through the 

sub-group tt'. An element x of the algebra (tt) has as components 
^{^uS') ; the numbers x(auS') may, for fixed ii, be considered as 
the components of an element x '^ of the algebra (tt'), so that x 
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may be considered as the set of elements x'^ belonging to the 
algebra (tt'). The formula y = xe' then becomes y^ = x^e' in 
(n') : hence x belongs to if and only if all the partial 

elements x'^ lie in p'. The correspondence 

X y — ax 

E S a{a^s'l'-^a-^)x{a/) 

t) t' in n' 

y'u == 

V 

where a'„^ is the element of the algebra {it') defined by 

a„^{s') = a{a„s'a-^). 

The representation <^'> may therefore be constructed as follows : 
first associate with the element a of (tt) the matrix the 

coefficients of which are elements of the algebra (tt') instead of 
numbers, and then replace each by the matrix associated 
with it in the representation f)' of tt'. 

As we have seen in the earlier part of the present chapter, 
the representations are obtained with the aid of a double Peirce 
decomposition ; we therefore consider the elements x = e'xe' of 
character {e\ e'). The idempotent elements • • • appearing 
in (10.2) are of this character, and such an element x may be 
expressed in terms of its components 

b 

X = Z + • • •. (10.3) 

«,/3 - 1 

We now repeat the analysis of § 7 for our more restricted set 
of elements: let be a one-to-one similarity correspondence 
of on p and let the element into which is sent by the corre- 
spondence be denoted by e^^*. If, as we now assume, 

the field over which the algebras are defined is algebraically 
closed e^xe^ is necessarily a multiple x^^ of We then obtain 
instead of (10.3) the reduction 

X = UXap^aP + * • *, ( 10 . 4 ) 

(where the x^^ are numbers) and the representations 

• • • (10.4') 

* Here, as in § 7, but in contrast with our usual notation, the product of 
two or more correspondences F is to be read from left to right. 


may then be written 

y(a„s') -- 
or 
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Now if in particular x is in (tt') then x — e'xe' is a numerical 
multiple of (10.2), and the matrix \\x^^\\ associated with such an 
element is a multiple of the unit matrix. — The degree of the 
secular equation, the solutions of which determine the character- 
istic numbers, is thus decreased from g to b for an element x 
of character {e\ e'). We now proceed to examine the cause of 
this. 

Let be a one-to-one similar correspondence of !p' on 
(z = 1, 2, • • •, g'), and let the element into which it sends e' 
be b [ . On considering an arbitrary element x of the algebra 
of 7T as the set a;', we see that the correspondence 

xe' -> xb[ 

is a one-to-one reciprocal and similar mapping of <p'> on <p/> : 
the projection T.' of p'. on p gives rise to such a projection of 
<p/> on <))'>. This projection associates with the reduction 
of (^p'y into irreducible invariant sub-spaces a reduction of the 
same kind of the sub-space (^p'-} ; corresponding to equation 
(10.2) we obtain the equations 

+ • • •. (10-5) 

On combining (10.1) and (10.5) we obtain a reduction of the 
modulus 1 into independent primitive idempotent elements of 
(tt). Now consider the partial sums 27^- 1 their reductions 

i 

(10.5) as written one above the other. Each row is then as- 
sociated with a definite representation of n' and each column 
on the right-hand side, the terms of which are sums of the form 
is associated with a definite representation ^ of tt. We 

i « 

now collect together all the summands ej occurring in the first 
column on the right, i.e. all those elements ej which are equivalent 
to e. The set of indices J is then broken up into sub-sets, each 
of which is associated with one of the inequivalent irreducible 
representations 1^', • • • of -tr' ; the first of these sub-sets, which is 
associated with i)', consists of the bg' double indices af. 

Let the similarity projection of p' on send e\ 

into e\. j. If x' is an element of (tt') the equation 

x' = ye'i x'e\ -+-••• 

\,k 

yields the reduction 




( 10 . 6 ) 
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with numerical coefficients and x' *-> W^aW is the representa- 
tion (The partial sums should preferably be written one 
above the other rather than horizontally.) JT/ may be con- 
sidered as a similarity transformation of <p'.> on <p'> and 
therefore contains a transformation of the same type of on 
pa ; r^iFa then provides us with a similarity correspondence 
of pai on p. Let Fj be a fixed one-to-one similarity correspond- 
ence of pj on p and let the similarity correspondence F jF^^ of 
pj on px send ej into We may take the correspondence 

F'iFa as Fj for the index J — az, and similarly for the remaining 
sub-sets. On applying the correspondence F-r^{Fl^F^)~^ 

to equation (10.5) we find 

^i;k = H ^ai; 4 * * * *• 

a = l 

The equation 

X = 2Jej ^ -f- • • • = O; X “h ' * * (10.8) 

J,K J, K 

then determines the representations 

f) : X \\xjx\\ ; * * •. 

By (10.6) and (10.7) the matrix associated with an element x 
of (tt') is 

pk — ^ap^ik) ^JK “ ^ 

where the two indices J and K belong to different sub-sets. 
But this means that on restricting tt to tt' the representation 
is reducible into the irreducible representations • • • of tt', 
p' appearing exactly b times. We have thus obtained a con- 
structive proof of the theorem ® : 

First Reciprocity Theorem {for arbitrary groups). If 
contains the representation p of tt exactly b times, then on restrict- 
ing the group tt to tt', p contains the representation f)' of tt' exactly 
b times. 

If the sub-group tt' consists merely of the unit element 1 
this theorem reduces to our previous result : the number of 
times an irreducible representation appears in the regular 
representation is equal to its dimensionality. Both the com- 
plete theorem and this special case depend on the assumption 
that the field over which the algebra is defined is algebraically 
closed. 

Connection with Symmetry Classes of Tensors. 

We apply the results of our investigation III to the symmetric 
group TT and make use of the correlation described in I above for 
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7 T as well as for its sub-group tt'. An irreducible sub-space p 
of (tt) determines a symmetry class ^ of tensors ; let the 

corresponding representations of tt and the linear group c be 
p and respectively. An irreducible invariant sub-space p' of 
(tt') determines a symmetry class of tensors which is invariant 
with respect to the more extensive algebra S' of all transforma- 
tions which are symmetric with respect to tt' ; as such is 
irreducible. If e' is the generating unit of p', consists of all 
tensors of the form e'F ; but this is equivalent to saying that 
the symmetry element F of {tt) belongs to <p'>. Hence the 
reduction of into irreducible invariant sub-spaces with respect 
to the more restricted algebra S parallels the reduction of <p'>. 
Let t)' be that representation of tt' induced in p' by the regular 
representation of tt' and that representation of c whose sub- 
stratum consists of all tensors in the symmetry class Hence 
our general theorem — or rather its converse, the truth of which 
follows immediately from the theorem itself — allows us to state 
the 

Second Reciprocity Theorem {applicable only to permutation 
groups) If the irreducible representation 't) of tt contains the 
irreducible representation p' of tt' exactly b times when considered 
as a representation of the sub-group tt', then conversely the repre- 
sentation of c contains the representation § exactly b times. 

Finally we take tt' as tt^ x Tig as in step II above, p' can 
then always be taken in the form p^ x p 2 , and the irreducible 
invariant sub-space p^ of (ttJ determines a symmetry class 
of tensors of order (a — 1, 2). Denote the corresponding 
representations of tt^ and c by p^^ and The associated 

with p' = pi X p 2 consists of all tensors of order / = /i + A 
which satisfy the symmetry conditions of with respect to 
their first /j indices and the symmetry conditions of ^2 with 
respect to the last / 2 ; i.e. X ^ 2 - 0^^ theorem now 

becomes : 

Third Reciprocity Theorem {for permutation groups). If the 
irreducible representation of tt contains, on restricting tt to the 
sub-group tt' == TTi X 772, representation pj X p 2 of tt' exactly 
b times (p« an irreducible representation of Trf), then conversely the 
representation §1 X Jp 2 of C coyitains the representation § exactly b 
times. 

§ 11. Perturbation Theory for the Construction of 

Molecules 

We return to the investigation of the physical system II 
consisting of / electrons or equivalent individuals I. As long 
as we disregard the interaction between the individuals we obtain, 
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among others, /Mold energy levels E of the type (8.4). We 
consider in particular the case in which the £, are different 
simple levels of the individual /. In order to follow the resolu- 
tion of £, due to the mutual interactions of the electrons, to 
the approximation which characterizes the perturbation theory, 
we must first determine the elements a of the algebra of tt, the 
components a(5) of which are the exchange energies, and trans- 
form the matrices corresponding to a in the various irreducible 
representations of tt into diagonal form by an appropriate 
change of co-ordinates (§ 8). We now assume that the most 
important of the exchange energies a{s) are those belonging to 
the permutations 5 of a certain sub-group tt' of tt ; all others 
shall be small in comparison with them (“ quantities of 2^^^ 
order ”). Our procedure is divided into two steps, corresponding 
to the investigation of sub-groups carried out in the preceding 
section. Let a' denote that element of the algebra (tt') which is 
defined by 

a\s) = a{s) or 0 

according as .y is an element of the sub-group tt' or not, and let 
the matrices associated with a' in the irreducible representations 
of tt' be referred to principal axes ; then 

^•a'4 = 0 (t 4= k), e[a'e\ = Wi • 

The characteristic numbers Wi are the energy levels on neglecting 
perturbations of 2*^^ order ; we assume they are all different. 
In order to examine the further resolution of such a term 
W = Wi under the influence of the 2*'^ order perturbation we 
need, in accordance with the perturbation theory, to consider 
only that part 

a* ~ e'ae' 

of a which is of character (^', ^'), where we have written e' in 
place of e'i. This term yields b terms W^c belonging to the 
symmetry class x associated with the irreducible representation 
of TT, the values of which are the characteristic numbers of 
the matrix lla*^l| associated with the element a* — e'ae' as in 
(10.4'). All the algebraic elements appearing in these con- 
siderations are real and the corresponding matrices are con- 
sequently Hermitian. 

We apply the procedure to the process by which molecules 
are constructed from their constituent atoms. We consider 
as an example two atoms joining to form a molecule, the one 
containing and the other electrons ; / = /i + A- We 
consider the two nuclei as held fixed at a distance d apart, which 



PERTURBATION THEORY 


341 


is large compared with the linear dimensions of the atoms, and 
attempt to determine their interaction energy as a function of d. 
The sub-group tt' = x wj consists of all permutations which 
send no electron of one atom over into the other ; we have seen 
in § 10 that we may then take the primitive idempotcnt elements 
e'. = e' of the algebra (tt') in the form ei X e^, where ei, are 
in (tti), (ttj) respectively. On neglecting the interaction between 
the electrons of the one and the electrons of the other atom we 
obtain an energy term W which belongs to definite symmetry 
states of both atoms, e' generates a sub-space X ^2 

(of the tensor space which is invariant under all symmetric 
transformations ; that the state of the molecule is described 
by a tensor of this sub-space ^ means that the state of the first 
atom is in and that of the second in ^2- Hence on reducing 
in parallel with the reduction of <p'> into irreducible in- 
variant sub-spaces : 

- se. + • • •, <p'> = = + • • •, 

(X ^ a (X 

there occur b sub-spaces which are equivalent to one another 
and which belong to a certain representation of tt- or to a certain 
symmetry class of terms of the total system. The procedure 
sketched in the preceding paragraph thus leads to b terms which 
(1) arise, due to the perturbation, from the given unperturbed 
term (8,4) and (2) which belong to certain given symmetry 
states Xi, X 2 x of the two atoms and the molecule. This 

reduction of the total system space W into sub-spaces, each of 
which corresponds to a definite symmetry state of each of the 
atoms taken separately and of the molecule, naturally is not 
bound up with the approximate calculation of levels with the 
aid of perturbation theory ; the connection between the two 
appears only on taking the above condition (1) into account — 
the very essence of which implies the assumption of small per- 
turbations. This somewhat sketchy account of the situation 
arising from an unperturbed term of the type (8.4), in which 
the energies of the individual / are non-degenerate, can readily 
be extended to cover other more complicated types of unper- 
turbed terms. These other cases are of course of much greater 
physical interest, for we have seen in Chapter IV that all atomic 
energy levels, except 5-terms, are necessarily degenerate. 

The fact that the total system may be in any one of several 
symmetry states corresponding to different energy levels 
(i.e. binding energies), when the symmetry states of the com- 
ponent atoms are given is of greatest importance. We shall 
later show that these possibilities, finite in number, coincide with 
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those predicted by the empirical theory of the valence bond, and 
that consequently the symmetry state of an atom is that which 
chemists call its valence state. The situation thus arising cannot 
be described adequately in terms of classical models — e.g. the 
fact that the two H atoms constituting an H 2 molecule can be- 
have in such a way that the state of the molecule may lie in 
either the space of symmetric or anti-symmetric tensors of 
order 2 ; only the first case can lead to an attraction which will 
bind the atoms together — the second always results in a re- 
pulsion.^^ The binding energy between two ions of total residual 
charges is naturally due mainly to the Coulomb potential 

e^e^jd (“ ionic binding ” or “ polar bond ”), but the corresponding 
energy for two neutral atoms is due for the most part to the 
interaction of the “ exchange energies ” a[s) of the electrons of 
the two atoms (“ atomic binding ” or “ non-polar bond ”). 
This quantum-mechanical solution of the puzzle offered by the 
non-polar valence bond was first given by F. London and 
W, Heitler, 

The following points are to be taken into consideration in 
applying the theory of perturbations to the actual evaluations. 
On neglecting the interaction between the various electrons 
each is subject only to the attraction of the two nuclei ; we 
should therefore perhaps begin with the characteristic numbers 
Ei and the corresponding characteristic functions ifji{xyz) of 
this one-electron problem. The first approximation should then 
be obtained by taking into account the repulsions between the 
electrons of each of the atoms separately, thus introducing a 
dynamical difference between the two kinds of electrons. This 
procedure is naturally significant only so long as the distance d 
between the atoms is large in comparison with their linear 
dimensions a. But then it is also reasonable to take as our 
0^^ approximation that in which each of the electrons is subject 
only to the attraction of its own nucleus (plus the closed shell 
of electrons which are not to be taken into explicit account in 
the calculations). Let this one-electron problem for the first 
atom have the characteristic values Ei and characteristic func- 
tions and let the corresponding quantities for the second 
atom be Ei>^ The fact that the ipi and the ipi' together 
cannot constitute an orthogonal system — indeed, they are not 
even linearly independent, for the ipi alone constitute a complete 
orthogonal system — causes some difficulty. But if we break off 
the series of quantum states at a finite n — which can be chosen 
higher the larger the value of d/a under consideration — the 
finite set 

•A : *Pl, •As. ■ ‘ ; <Al', <^2', ' • 'An' 
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of functions tfs constitute an almost orthogonal system ; the 
fundamental metric form G^, the coefficients of which are the 
scalar products 

gik = (•Pi, Pk) = iPiPkdV 


(where i and k run through the primed as well as the un-primed 
indices), differs but little from the unit form. Indeed, an integral 
of the form (i/»i, pi') is of order of magnitude To show 

this we note that if the two centres of force are nuclei or closed 
cores 'with “ unit ” residual charge, the normal states of the 
atoms are given by 


Pi =: 


1 

V 


e-rla^ 


Pv = 


1 


^-rVa, 


where r and r' are the distances to the two cores. The integrand 


m 


P^,Pr)---= 


is everywhere ^ This integral can readily be exactly 

evaluated on introducing bi*polar co-ordinates (r, r\ (f)) ; the 
volume element is then 

dv '~rr' dr dr' 
a 

and the range of integration is defined by 

r + r' ^ — d ^ r — r ^ d. 

On introducing 


r r' r — r , d 

= Pj 


d 




we obtain 


00 + I 


(^1, 0r) ^ j(p“ — P^)e ^^dpdp 
1 - 1 


For the /-electron problem we therefore start with the 
functions 

P(h, • • if) = n Pi{xyz) 
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as approximations to the characteristic functions ; in this 
product the co-ordinates are those of the / electrons and i runs 
through the values Zi, Z 2 > * ' % each of which is one of the primed 
or un-primed indices between 1' and n' or 1 and n. The funda- 
mental metric form G Gq X Gq X • • • X Gq has as components 
the scalar products of Z 2 ‘ ‘ h) with ^2 ’ ' ’ ^/) 

the components of the energy //, the potential part of which is 
obtained by adding together the potential energies resulting 
from the attractions and repulsions of the various electrons and 
the two cores, are the scalar products of • • • if) with the 
vector * ‘ ‘ ^/) into which • • • kf) is sent by the 

operator H. We consider the resolution of the unperturbed 
term 

+ /?,,) + {Ey + • • • + Ef^. 

The components 

G(zi * if \ • • • kf) and H{ii if ] • • • kf)^ (H-l) 

in which the indices z, k arc permutations s, respectively, of 
1, • * '» /i> nre of the form G{sr^) and H{sr^). We 

introduce the (real) elements O and H with components G{s) 
and H{s). O and H are next replaced by G' and H' with com- 
ponents G{s) and H{s) if s is in tt' = tti X TTg, and 0 otherwise ; 
the justification for this lies in the fact^ that the components 
associated with an s which is not in tt' are very small — they are 
of relative order G' is in fact the modulus, whereas G 

is not ; the procedure employed previously must therefore be 
modified in the following purely formal respect. On repeating 
the reasoning, keeping in mind the fact that G is no longer the 
modulus, we find as the secular equation for the determination 
of the h terms A = 

\\G,p- = ( 11 . 2 ) 

in which 

e'Oe' = + • • •, 

in terms of the notation employed in the preceding section. 

This procedure is open to the criticism that whereas the 
second order perturbations between the electrons of the same 
atom are neglected, the interaction between the two atoms, which 
is considered to be of second order, is taken into account. The 
results are therefore inapplicable to the limit dja -> 00 and can 
at most be applied successfully in cases in which dja is consider- 
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ably larger than 1 but not too large. On the other hand, we 
could begin by assuming that the solution of the quantum 
problem for the individual atoms is already known. Let the 
function of the co-ordinates of the first fi electrons be a 
characteristic function of the first atom corresponding to the 
energy term (so normalized that the integral of is unity) ; 
it will belong to a certain simple symmetry state of the first 
atom, i.e. there exists a certain real primitive idempotent element 
ej of (tti) such that ~ Similarly, let 02 be a character- 
istic function of the second atom for the term £ 2 ^ having a 
corresponding property ^2 02 = 02- Neglecting the interaction 
between the atoms, 0 0i . 02 is a characteristic function of 
the molecule consisting of the two atoms and having the energy 
E:=^E, + E2, e'^e, X e2 is a primitive idempotent element 

of the algebra of tt' — X 7r2 and 0 has the property 

e'0 ~ 0. 

The functions S0, which are obtained from 0 by the totality of 
/! permutations 5 of its arguments, span a linear function space 
(5R) of a finite number of dimensions — in which the S0 are natur- 
ally neither linearly independent nor mutually orthogonal. 
The theory of perturbations requires us to find those functions 
0 of (9?) which are such that the orthogonal projection of 7/0 
on (9t) is proportional to 6 itself ; the factors of proportionality 
are then the values of the displaced terms, to a first approxima- 
tion. We must therefore evaluate the integrals G{s, /), 7/(5, t) of 

tijj • sp and tip • 77(s0) 
and solve the secular equation 

1AG(5, t) - H{s, 01 - 0. 

G and H depend only on r'^s :* 

G{s, t) = G{r^s), H{s, 0 = H{r^s). 

This is proved by the fact that the integral o{ t(i • <f) is unchanged 
on replacing tp, <j> by tip, r<p (r an arbitrary permutation) ; Hisip) 
is equal to sHip because of the symmetry of the operator H. 
Let O and H again be the elements of (tt) with components 
G{s), H{s). They satisfy the equations 

e'Oe' = a, e'tle' - t1 

♦On comparing this with (11.1) it is to be remembered that there the 
permutations s and i operate on the indices and not on the arguments ; hence 
the elements (11.1) are. in our present notation, 

G(/~i, 5-q and H{t~\ s~^). 
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and are therefore of character {e' , e'). Indeed, we have, for 
example, 

tf/ = ' ftjj, whence • H{srifj), 

r r 

and on multiplying this latter by ^ and integrating we find 
H{s) = Ee'{r-^)H{sr) or H = He'. 

r 

It then follows that also H — e'// whence, since e' is real, 
H — e'H and consequently H = as asserted. 

The only non-vanishing elements of the matrix \\Hjj^\\, 
which corresponds to the element H in the representation 
are (in the notation of § 10 with e[ — e') those contained in the 
square sub-matrix of length b in which the row and column 
indices J and K are of the form al. We are thus led directly 
to the secular equation 

I — fiocp 1 = 0 

of degree. (The most natural method of solving this equation 
consists in finding that linear transformation which sends the 
Hermitian form with coefficients into the unit form and at 
the same time reduces ||H«/?|| to diagonal form.) is then 

a 

the trace of the matrix belonging to H in the representation f), 
or 

^ m(s)x(s). 

(X 8 

If in particular ft — 1 the above symmetry system of the 
molecule contains but a single term arising from the unperturbed 
term E ; its value is, in accordance with the equation derived 
above, given by 


Zms)x{! ) = E + rH(s)x(s) 

ZCWxW 1 + S'G{^)x(s) ■ ' ' ' 

The accent on the right-hand side indicates that these sums are 
to be extended over only those permutations s which do not 
belong to tt' . This formula (11.3) is due to F. London.^^ It 
will be shown later that in the case of diatomic molecules b 
is always 1 ; we must expect, however, to find higher values of 
b in dealing with more complex molecules. The real difficulty 
from the physical standpoint naturally consists in getting in- 
formation concerning the exchange energies H{s). It is to be 
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noted, however, that we need only to concern ourselves with the 
sums 

EH{rsr-^), EG{rsr-^) 

r r 

over the various classes, for since x{^) ^ class function all 

summands in (11.3) for elements in the same class f may be 
added together to give the above coefficients multiplied by x(^)• 

Without doubt these investigations, which are as yet in their 
infancy, are of fundamental importance for theoretical chemistry ; 
the non-polar bond is due to the exchange energies. Heisenberg 
has given an explanation of ferro-magnctism with the aid of 
these same principles. 

§ 12. The Symmetry Problem of Quantum Theory 

On taking the spin into account the components of a vector 
x{u)^ which represents the state of a single electron, has two 
indices t and i ; the first of these refers to the spin and runs from 
1 to V, while the second refers to the translation and runs from 
1 to n. Actually v “ 2 and n — oo (as long as we do not restrict 
ourselves to the consideration of quantum states with fixed 
energy). Our vector space 91 is accordingly 91»,n X 91^- 

The state of a system consisting of / electrons is now to be 
represented by a tensor of order / in this space : 

• • •, Lfif ) — a “ double tensor which stands, so to speak, with 
one foot (the Greek indices) in the space iHy and the other (the 
Latin indices) in 9in- This tensor space is completely reducible, 
with respect to the algebra of all symmetric transformations 
of the index pairs (c/), into irreducible invariant sub-spaces, 
each of which is generated by '.n idempotent symmetry operator. 
The Pauli exclusion principle states that only one of these sub- 
spaces physically realized ; it automatically abolishes the 

physically absurd existence of multiplicities which cannot be 
resolved and at the same time denies the existence of absolutely 
non-combining systems of terms. Furthermore, according to 
Pauli this is the space {9tfv„} of all anti-symmetric double 
tensors. 

On ignoring the spin perturbation, is to be reduced as far 
as possible into sub-spaces ^ which are invariant with respect 
to the special symmetric transformations of the form 

• • • lfif) = Ec{h "'if', k\" 'kf)' F{iiki • • • ifkf) ( 12 . 1 ) 

(*) 

which do not depend on the Greek indices at all ; these constitute 
our old algebra S — S„. This transition from I!,„ to Z„ is to 
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be accomplished in two steps. We first ignore the interaction 
between spin and translation, but allow the translations to 
interact among themselves in an arbitrary manner and similarly 
the spins among themselves ; we must then consider only the 
symmetric transformations of the form 

y(ii • • • 1/ ; Ki • • • Kf) • c(ii ' • • if ] ki • • • kf). (12.2) 

These transformations do not constitute an algebra themselves, 
but they belong to their “ enveloping ’’ algebra 2^ X 2^ which 
consists of all transformations whose coeflFicients 

c(t,fi • • • ifif; Kiki • • • Kfkf) 

are unaltered on subjecting the two rows • • • Kf 

of Greek indices to the same arbitrary permutation a and the 
two rows of Latin indices to the same arbitrary permutation s. 
The second step then consists in letting y in (12.2) be the identity. 
The first step thus consists merely in making the permutation 
of the Greek indices independent of the permutation of the Latin 
indices, and the second in restricting the first of these permuta- 
tions to the identity. 

In the first place, then, we introduce the elementary sym- 
metry operator a X s which, on applying it to the double tensor 
P{^ih * * ’ subjects the Greek indices to the permutation 

cr and the Latin to the permutation s. The general symmetry 
operator is then an arbitrary linear combination 

a ™ ^)(<^ X s) 

< 7 , S 

of these elementary ones ; we have thus to deal with the algebra 
p X p of elements x, the components x(or, s) of which are functions 
both of whose arguments run through the elements of the group tt. 
We denote the element with components s) — (a X s)F 

by P ; the equation F' — aF (F' the double tensor obtained 
from F by the operator a) is equivalent to F' ~ F • k. The 
group TT X TT of elements a X s contains tt itself as the sub-group 
consisting of elements s X s. So far as the first step is con- 
cerned, our problem amounts to the following : Let l{s) be the 
components of a primitive idempotent element of the algebra 
t = p (tt) ; we set 

/ yi{s){s X s) 


and study the elements of the form xl in p X p. They con- 
stitute an invariant sub-space (t X r)i which is to be reduced 
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into its irreducible invariant constituents ; in Pauli’s case we 
have in particular 

X s). 

The procedure which it seems natural to follow is first of 
all to express the modulus 1 of /> in any two ways as the sum of 
primitive independent idempotent elements : 

1 1 = (12.3) 

♦ ; 

An arbitrary element x of the algebra of x p is reduced into 
independent constituents in accordance with the equation 

^ X ej) = (12.4) 

Now we know from § 10, II, that the elements of the form Xi^ 
constitute an irreducible invariant sub-space ; consider 

xl = I 

in this light. The projection x -> y ^ xl sends over into 
a certain invariant sub-space (p,j) of (r X r)^. Since those 
X of for which xl = 0 constitute an invariant sub-space of 
ptj we have only the two typical possibilities : either {p^j) = 0 
or this projection x xl maps in a one-to-one and similar 
manner on (p,;). The sum 

(t X (12.5) 

arranged in some particular order, is such that each term can, 
in virtue of its irreducibility, only either be contained in the 
sum of the preceding terms or be independent of this sum. On 
retaining only those terms arising from this second possibility, 
(r X r)j is completely reduced into the sum of certain of the 
(|),^) ; the representation induced in (t X r)^ by the regular 
representation of the group tt X tt is correspondingly reduced 
into its irreducible constituents of the form X It will be 
remembered that this symbol stands for the correspondence 

{a, s) -> U'(a) X U(s), (12.6) 

where f)', I) are the irreducible representations a -> U'(a), 
s U{s) of TT. This representation X f) appears with a 
certain multiplicity b{x', x) which is determined by the number 
of pairs ij in (12.5) whose e/ generate the representation i)’ 
and whose generate These considerations are of course 
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merely a repetition for the case at hand of the proof of theorem 

( 6 . 1 ). 

We now return to the space of double tensors and consider 

the sub-space S defined by those of the form IF. It is the 

substratum of a certain representation X I!„) of H, X Z„, 

and its complete reduction is given by the formula 

X Zn) = Ibix', x)($>: X §„). (12.7) 

x'. X 

This remains correct even if or n is less than /. Earlier in 
this chapter we introduced the right- and left-invariant sub- 
space to of t as that sub-space consisting of all elements F which 
correspond to tensors F in the w-dimensional vector space 9f{„. 
On denoting this to, which depends on n (and only for n f 

n 

coincides with the entire t), by t we should consider the algebra 

V n p n 

XXX instead of r X r. But if is in r and in t, the manifold 

p n 

of elements x{e^ X e^) is not decreased on restricting to t X t, 
and every e'^ (ei) which is equivalent to such an e\ {e,) also 

V n 

belongs to t (t). This shows that (12.7) remains correct under 

p 1% 

this restriction to r X t ; the only effect is that those terms for 
which X §n is the 0-dimensional representation arc illusory 
We are now ready to take the second step : to perform the 
transition from the algebra U, X I!„ to Z = Z„ by taking y in 
(12.2) as the identity. We then see immediately that the 
representation S,{S) of Z, whose substratum consists of the 
double tensors of S in the sense of equation (12.1), is completely 
reduced into its irreducible constituents §, corresponding to 
the various primitive characters x of tt, in accordance with the 
equation 

2 ( 2 ;) = Zm{x) • §. 

X 

The multiplicity mix) with which this representation § occurs 
is given by 

n^ix) = Pix, xmx), ( 12 . 8 ) 

where N„{x) is the dimensionality of the representation .<p„, 
and the sum is extended over all the primitive characters x' 
of -IT. Hence on disregarding the spin perturbation we obtain 
the same type of reduction into non-combining systems of 
terms as before, except that the multiplicity, which was previ- 
ously equal to the dimensionality g of x, is now given by (12.8). 



SYMMETRY PROBLEM OF QUANTUM THEORY 351 

(The spin perturbation causes weak inter-system combinations 
to take place and, in addition, resolves each term of the system 
X into its fn{x) components. m(x) is the multiplicity of the 
multiplet structure. Term systems x which ni{x) = 0 do 
not appear at all.) 

Our reciprocity theorem enables us to determine the con- 
stants b. As mentioned before, tt is contained in tt X tt as the 
sub-group of elements of the form s X s \ the algebra p — (tt) 
appears in p X p as the totality of algebraic elements of the form 

Za{s){s X s). The elements xl of the algebra p constitute an 
8 

irreducible invariant sub-space ; let the irreducible repre- 
sentation of 77 which is induced in this sub-space by the regular 
representation be denoted by f)i and its character by X{s). The 
space of all elements of the form xl in p X p is then in the 
notation of § 10 ; it is the substratum of the representation 
<^ 1 > of p X p. contains the representation f)' X exactly 

b times ; the reciprocity theorem then tells us that the number 
of times the representation 1^' X f) contains the representation 
iji on restricting tt X 77 to its sub-group 77 is also b. Now this 
restriction to tt sends the representation (12.6) of 77 x 77 into 
the representation 

( 5 , s) -> U'{s) X U{s) 

of 77. This means, however, that b{x, x) number of times 

the representation of tt is contained in the representation X 
of TT (no longer with boldface multiplication sign !). Hence 
b is expressed by 

Hx, x) - m{x{s)xms-% (12.9) 

With this we have carried our solution of the problem of deter- 
mining the multiplicities ni{x) as far as is possible in the general 
case. 

Consider in particular the special cases (1) complete symmetry, 
fi — [5W], and (2) complete anti-symmetry, = [W ] — the 
Pauli case. For the first X{s) = 1. With each irreducible 
representation x is associated the contragredient representation 
with character xl-^) = i i^ substratum of the first 

is generated by the idempotent element e the substratum of 
the latter is generated by e. Or we may describe this situation 
by saying that x X characters of mutually contra- 

gredient representations. (Accidentally x{^~^) “ x('^) 
complete symmetric group tt ; this does not hold for a general 
permutation group, however, whereas our entire theory does.) 
Equation (12.9) now becomes 

Mx', x) = W{x'(s)x(^-‘)). 
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But in virtue of the orthogonality property of characters this 
mean value is 1 or 0 according as the representation x is equiv- 
alent to X The expression (12.8) for the multiplicity 

then assumes tlie simple form 

Mx) = 

The theorem that the representation X contains the identical 
representation ^ 1 once or not at all according as is equiv- 

alent to the contragredient of 1) or not is nothing other than 
the fundamental theorem [III, (10.5)] on which the entire 
theory of representations was based. 

In the second (anti-symmetric) case A(.9) = 8^. Now 

X*{s) = 8 , . 

is the character of the '' dtiaV representation associated 
with \) \ if f) is generated by the idempotent element e then !)* 
is generated by the idempotent = 8, • Or if 

I) : 5 ->■ U{s) then ; 5 -> 8, • U{s). The expression for the 
multiplicity is in this case 

"*(x) = ^vix*) (12.10) 

If we denote the 1-dimensional representation s -> by {!}, 
the fundamental theorem mentioned above tells us immediately 
that I)' X I) contains the representation {1} once or not at all 
according as is equivalent to or not. (12.10) is the actual 
multiplet formula, for this second case is the one which is of 
interest for atomic physics. 

Additional Remarks, 

The only cases of importance for physics, (1) that of sym- 
metric and (2) that of anti-symmetric double tensors, can be 
handled by elementary methods. We again refrain as long as 
possible from making restrictive assumptions concerning the 
field over which the algebras are defined. The method will be 
illustrated by application to case (1). 

(12.11) If ^ 1 , ^2 equivalent idempotent elements^ then 
^ 1 , €2 are also. 

Proof, Let -pi be mapped on p 2 by a one-to-one similarity 
correspondence P \ xfb ; b is here the element, of char- 
acter (^ 1 , ^ 2 )) which e^ is sent by jT. Let the inverse corres- 
pondence carry ^2 over into a, which is then of character (^ 2 > ^i)* 
r carries a over into ^2 ] since the element associated with a by 
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F is oi we have Similarly, we find with the aid of 

that ei — ba. We then have 

^2 = (J'bf ei = ba ; ^ 2 ^^i “ ^ibe 2 = b. 

Conversely, the existence of these equations guarantees that 

X 2 — x^b, Xi = X2a 

are reciprocal similarity correspondences :l)i;^p 2 - That is, the 
existence of these four equations means that and 62 are 
equivalent. We need only to “ roof ” these equations in order 
to conclude that and <?2 then also equivalent — i.e., go 
over to the quantities x associated with each of these x by the 
definition x(s) ~ x{s~^). We have here neither assumed that 
the e are primitive nor that the field is algebraically closed. 

(12.12). The invariant sub-spaces p generated by e. e are 
the substrata of mutually contragredient representations. 

Proof. Let p consist of all elements xe ; we introduce in 
addition to this left-invariant sub-space the right-invariant 
sub-space q consisting of all elements of the form ex. Let 
tr {xy) be the trace of the elements x and y, which may vary 
freely in p, q, respectively ; we assert that it is a non-degenerate 
bilinear form. That is : if tr (ay) = 0 identically in q then the 
element a of must be 0, and if tr {xb) = 0 identically in p the 
element ft of q must be 0. Indeed, if z is any arbitrary element 
whatever and a is in )), then 

az = ae * z = a • ez = ay, 

where y is in q. Hence the assumption that tr(ay) = 0 in q 
implies that tr{az) = 0 for arbitrary z, whence a = 0 [cf. § 4]. 
Similarly for the remaining case tT{xb) ~ 0. 

Now let p and q be referred to arbitrary co-ordinate systems 
and let the co-ordinates of x, y be ^ 2 , * * \ Vh 

respectively. Then tr{xy) is of the form 

tr {xy) = ZSik Vk- 
(».*) 

The theorem above shows that g '^h and h ^ g, whence h = g, 
and that the coefficients may be considered as the coefficients 
of a non-singular linear transformation. Hence on choosing 
the co-ordinate system in q in an appropriate manner tr [xy] 
may be reduced to the canonical form 

t 

tr {xy) = Eii-ni' 
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But then 

tr [xy) = tr {yx) — tr {yr~'^ ‘ rx). 

Hence the simultaneous substitution 

x' = rx, y' = 

which does not lead out of p, respectively, leaves the trace 
invariant. These two transformations are therefore contra- 
gredient in the new co-ordinate systems; our assertion (12.12) 
then follows immediately on writing the second of these equations 
in the “ roofed ” form ^ — ry and noting that y runs through 

.A. 

the left-invariant sub-space p generated by ^ as y runs through q. 

After this preliminary skirmish we apply the method em- 
ployed before, somewhat modified, to the case (1) in which 

X .). 

We are now interested in the reduction (12.4) only for symmetric 
elements i.e. elements which satisfy the equations 

x{ar^ sr) = s) (12.13) 

for all r. This amounts to replacing x hy xl\ we subsequently 
note that xl{e' X e) is not symmetric and accordingly multiply 
again on the right by /. We thus replace X ^ by l{e' X e)l 
rather than (^' X e)l and proceed to obtain an explicit expression 
for the reduction, rather than calling on the aid of the reciprocity 
theorem. First, the components of /(^' X e) are (on ignoring 
the factor 1//!) given by 

Ze\rG)e{rs) — 27<^(5~V"^)^'(r(T) = ee'{s~^a). 

r r 

This expression vanishes if ee' 0 \ for e' = e we find it is 
equal to eis^'^a) = e[G~^s). This suggests that we choose 

1 1 == E^i 

i i 

as the two complete reductions (12.3) of the modulus 1. The 
only terms in the sum (12.4) which then remain for symmetric 
X = xl are those of the form x{ei X ^,), and the factor USi X e?,) 
is the element with components ei{G~'^s), Since x{ej x Cf) has 
not been reduced identically to 0 on restricting to the domain 
of symmetric elements, the sub-space which it generates is 

here, as before, equivalent to the irreducible X The 
next step consists in multiplying on the right with /, whereby 
^((7“^^) becomes, in accordance with (8.3) and (7.22), 

p£e{r-^a-^sr) = ^ 
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Our final result is that any symmetric x can be reduced in ac- 
cordance with 

X =:= xe' + xe" + * * ', where £(a, s) — ~ ; (12.14) 

in deriving this result it is to be remembered that the number 
of times any irreducible representation appears in the regular 
one is given by its dimensionality. 

It follows from the fact that e{s) is a class function that these 
elements s', • • • constitute a set of independent idempotent 

elements in p X p. This result is in fact obtainable by direct 
methods and is valid, regardless of whether the field in which 
we are operating is algebraically closed or not. To show this 
we note that any “ symmetric ” element x{(t^ s) is a function 
only of sor~^ in virtue of (12.13) : x(a^ s) — x{sa~^). Thus there 
exists a one-to-one correspondence between the symmetric 
elements of p X p — the space of which we denote by [t X t] — 
and the elements of r. Direct computation shows that this 
correspondence associates with each left-invariant sub-space of 
[r X r] a left- and right-invariant sub-space of r, and conversely ; 
the reduction of [x X t] into left-invariant sub-spaces thus 
parallels the reduction of r into sub-spaces which are both left- 
and right-invariant. The whole problem is thus much simpler 
for [r X r] than for r itself ; its solution is obtained by carrying 
over the equation 

X = xz' xz" + • • • (7.5) 

for the algebra p to [r X t], the result of which is (12.14). 
Nevertheless we must return to the previous less elementary 
analysis in order to see — and this result presupposes that the 
field is algebraically closed — that each of the irreducible in- 
variant sub-spaces of [t X t] obtained in this way is equivalent 

to a sub-space of the algebra r X t of the form )) X )) (where 

/N 

p and :p are irreducible invariant sub-spaces of t with generating 
units e and e). 

The completely anti-symmetric case can be dealt with in a 
corresponding elementary way. 

The complete reduction of the manifold of tensors in the 
2-dimensionar spin space v = 2, is accomplished with the 
aid of the Clebsch-Gordan formula [III, (5.9)]. (c)^ is X X 

• • • X (^1 (/factors), where ©1 is the representation of the linear 
group c = C 2 by itself, and by the formula mentioned above this 
representation is completely reducible into the irreducible 
where v can assume only the values /, / — 2, / — 4, • • •. The 
dimensionality of is y -T 1, and to each of these possible 
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limensionalities there corresponds here but one irreducible 
epresentation. Formula (12.10) then tells us that there exists 
nly one term system having the multiplicity y +!(=/+ 1, 
— 1, / — 3, • • •) ; compare the beginning of § 15 on this point. 
The preceding analysis seems to me to be necessary in order 

0 obtain a complete understanding of the relations implied by 
he permutation group without recourse to the approximation 
haracteristic of the theory of perturbations. So far as the 
itter is concerned we proceed as follows. Again consider a 
erm of the form (8.4) of the unperturbed system, the only 
iegeneracy of which is that necessitated by the equality of 
he / electrons. The perturbation equation is then 

Finn, • • •, hif) = Ea{st-^) • • • •, Lfkf), (12.16) 

t 

^here the a{s) are the exchange energies and • • • if, • • • kf 
re obtained from 1 • • • / by the permutations t respectively, 
.et (f) be the tensor in spin space defined by 

F(til, 122, • • •, hf) = ^(q‘2 ‘ • v) ; 

he anti-symmetry of the double tensor F then tells us that 

F{hh, • • •. ‘F’/) = 8, • s-*</.(ti • • • Lf), 

nd on letting a'(s) = S, • a{s), (12.16) becomes 

<f> = a’4>. (12.16) 

he problem is thus reduced to that of finding the characteristic 
umbers of this linear correspondence in the 2-f-dimensional 
pace 91{. 

Let ^,{P) be the characteristic functions of the single electron, 
f the perturbation is due solely to the Coulomb forces between 
[le various electrons, that part of the energy matrix a{ii if ; 

1 ' • ' kf) which is due to the perturbation is obtained additively 
•om terms of the form 


■ MPi) ■ ■ ■ HP'^ 

• ■ ■ j ~ dV,--- dV, 

'here a 4= jS and the denominator is the distance between the 
ivo points P, and P^. The orthogonality of the tft tells us that 
bis integral can be non-vanishing only if the permutation s, 
rhich sends the set of indices k into the set i (both of which 
re permutations of 1, 2, • • *, /), is either the identity or the 
ransposition (ajS). In this latter case we find 

„(s) = E ., = iimiAELmMnivdv. 
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On the right-hand side of (12.16) we then have only the terms 
arising from i = I and the transpositions s = (ajS) : 

4 = {«(!) “ (12.17) 

Dirac has given a remarkable formula for the transposition 
acting on a spin tensor. Let ©• be the spin of the a‘** electron ; 
5^, Sy, S* are then the operators 


0 1 

! 

0 -r] 


1 

0 

1 0 

1 

i 0| 

) 

0 

-1 


acting on the a*'** index of the tensor • • • t/). On calculating 
in particular 

{&&) - SlSl -f SlSl -f SlSl 

(which should perhaps be written (<5^ X ©*) instead, since <3^ 
affects only the first index and only the second), we find that 
it is the operator 


t2 


6 0 

1 


1 0 

- 1 

2 

0 1 

2 

- 1 

1 1 

i 

1 


acting on the first two indices, all other places being 0. Hence 
^{1 + 0^©*)} is the substitution 

<^(00) 4,{00), <f>{U) -> <^(11) ; <^(10) .^(01), <^(01) -> ^(10) 

or the transposition of the first two indices. The energy (12.17) 
may then be written in the form 

H=-.E,- \ i;F,^(©*©^). (12.18) 

This may be interpreted as saying that the coupling between 

the electrons a and j8 is responsible for the term — 

in the energy operator. However, the constant does not 
represent the energy of the unperturbed system.'® 
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C. Explicit Algebraic Construction 

§ 13. Young’s Symmetry Operators 

We now supplement the general theory developed above 
by an explicit algebraic construction of the irreducible repre- 
sentations of the symmetric permutation group v = Vf. This 
problem is, as we know, equivalent to that of constructing the 
primitive symmetry classes of tensors of order / by means of 
idempotent symmetry operators e ; here a “ primitive ” sym- 
metry class is one such that the symmetry of the tensors be- 
longing to it cannot be further increased by the addition of 
further symmetry conditions — such an additional condition 
either reproduces all the tensors of the class or reduces them all 
to 0. This construction is due to A. Young and G. Frobenius ; 
with its help we are able to verify step by step the entire theory 
of representations of the symmetry group in an explicit and 
elementary manner. 

We are already acquainted with two very simple processes 
which yield tensors of maximum symmetry : “ symmetrization,” 
by means of which the tensor F yields the completely symmetric 

tensor 2JsF, and “ alternation,” which sends F into 2^8, • sF. 

8 8 
The first of these processes can be readily generalized as follows : 
We divide the range from 1 to n of the ” variables ” •••!/, 

on which the general tensor component F(zjt 2 • • • f/) depends 
(or, what amounts to the same, the sub-indices 1, 2, • • •, /), 
into sub-sets of lengths /i, /a, • • • ; /i + A + ' ' ' = /• We then 
symmetrize with respect to the indices of each of these sub-sets. 


rrrr 

"rn 1 

MM 

1 1 

MM 

1 

MM 

1 

1 1 



Pattern 7, 5, 4, 4, i. 

This distribution into sub-sets may be readily visualized with 
the aid of a ‘‘ pattern ” P — P(/i, fz, • • •) as illustrated in the 
accompanying figure [for the pattern P{7, 6, 4, 4, 1)] ; each of 
the / squares in the pattern is occupied by a different one of the 
/ integers 1, 2, • • -, /. Each of the sub-sets mentioned above 
constitutes a horizontal row of the pattern, and the various rows 
are arranged one under another. The individual sub-sets may 
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be arranged in order of decreasing length : ^ /a ^ ; the 

pattern then consists of non-interrupted vertical columns as 
well as non-interrupted horizontal rows. Those permutations 
p which permute the members of each row among themselves 
constitute a sub-group {p) of -n of order /i 1 / 2 ! • * • [denoted in § 8 
by ■"■(A. A) ■■■)]• The symmetry operator described above, and 
which is to be applied to an arbitrary tensor, is 

a = Zp, 

V 

henceforth p will always denote an arbitrary permutation which 
sends no numeral of one row into another row. 

So far we have made no use of the process of alternation. 
If after having symmetrized with the aid of the operator a we 
alternate with respect to certain of the variables or sub-indices 
1, 2, ••*,/, we certainly obtain 0 if any two of these numerals 
are in the same row, for the tensor obtained by the symmetriza- 
tion is symmetric with respect to any two such numerals and 
the result of subsequently alternating with respect to them must 
be 0. To avoid this situation we choose one variable in each of 
the rows and alternate with respect to them ; since the order 
of the variables in each row is so far immaterial we may place 
these chosen variables in the first column. We then disregard 
the first column and proceed to alternate with respect to a set of 
variables obtained by selecting one from each row of the re- 
mainder of the pattern ; these variables may now be shifted into 
the second column. This process is continued until we have 
covered the entire pattern ; the result is that we have symmetrized 
with respect to the rows and have followed this symmetrization by 
alternation with respect to the columns. Let q denote an arbitrary 
permutation which permutes the variables in each column among 
themselves ; these q constitute a certain sub-group [q) of tr. 
The alternation described above consists in applying the sym- 
metry operator 

and the entire process consists in applying the resultant operator 

c = ba = EK- qp- 

p,i 

We call c the Young symmetry operator belonging to the 
pattern P. 

In order to obtain a unique symmetry operator c associated 
with a given pattern P we must specify the way in which the 
numerals from 1 to n are to be distributed in P ; they shall be 
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introduced in such a way that on reading the pattern, as one 
would read a page of a book, they appear in their natural order 
1, 2, • • •, /. If we write them in any other order, say that ob- 
tained from the standard form with the aid of the permutation r, 
we obtain a “ conjugate ’’ element Cr which, as is readily seen 
on considering the relation between the tensors generated by 
these two operators, is related to c by 

Crt = rc or = cir^^sr). 

Hence the introduction of r results merely in a new name. 

From now on we operate with symmetry quantities, i.e. 
elements of the algebra (tt), instead of tensors ; we consider the 
invariant sub-space pc of t consisting of all elements of the form 
y ~ xc and the representation t)c of tt induced in it by the regular 
representation. With pc is associated the symmetry class 
of all tensors of the form cF. If we replace c by one of its con- 
jugates Cr we obtain instead of pc an equivalent invariant sub- 
space ; in this sense the order in which the variables are written 
in the pattern is quite immaterial. We hope that pc is irre- 
ducible and that the totality of representations pc associated 
with all possible patterns constitutes a complete set of inequi- 
valent irreducible representations of tt. This hope is strengthened 
by the fact that the total number of patterns is just equal 
to the number of inequivalent irreducible representations. To 
show this we note that the number of patterns is equal to the 
number of partitions of / into integral non-negative summands 
/ = /i 4-/2 + * * ' which satisfy the condition /i ^/2 ^ * • *. 
On writing 

fl A “ a a “ ^2 ) 

we see that this number is equal to the number of solutions of 
the equation 

Iri -f 2r2 -}- 3r3 -f • • • — / 

for non-negative integral r. But we have already seen that this 
is the number of classes of conjugate elements in tt and, by the 
general theory, is therefore equal to the number of inequivalent 
irreducible representations of tt. 

If the dimensionality n of the vector space is less than / 
the only non-vanishing symmetry classes are those arising from 
patterns containing at most n rows, for if the first column is 
longer than n alternation with respect to the variables standing 
in it alone causes an arbitrary tensor to go over into 0. The 
only patterns which we need in this case are consequently those 
obtainable from the algebra to, instead of t, where to = as 
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defined in § 2 above. The number of inequivalent irreducible 
invariant sub-space.s into which the ten.sor space W can be 
reduced is accordingly decreased to the number of partitions 
of / into n integral summands / — /i + /s + • • • + /n for which 

A permutation s — qp which is obtained by composition 
from a permutation p of (p) and a permutation q of {q) can be 
so obtained in only one way. This is an immediate consequence 
of the remark that the equation qp — I can be fulfilled only by 
^ = I, ^ = 1^ for it asserts that p — q~^ belongs to {p) as well 
as to [q). The components of the symmetry operator c can 
therefore be described as follows : c[s) — 0 unless s belongs to 
the set (q){p) ; when s belongs to this set c(s) = i 1 according 
as the unique decomposition s — qp yields an even or an odd 
permutation q. 

We must now prove the following three assertions con- 
cerning c : 

(1) c is essentially idempotent ; or, more precisely, c satisfies 
an equation cc -- y • r, where y is a non-vanishing numerical 
factor. Furthermore, y is an integral positive number which 
is a factor of /!. Then replacing ehy e — cjy, e is idempotent. 

(2) The sub-spaci; pc is irreducible, tlie e introduced in (1) is 
primitive. 

(3) Different patterns lead to incquivalent sub-spaces pc- 

The execution of this programme depends upon a simple 

combinatorial auxiliary theorem, which we now proceed to 
develop. Denote the lengths of the columns in the pattern 
P with rows of lengths /j, J 2 , ' ' ' by /*, /*,•••: 

■ •: 

/] + A + ■ * ’ /i + A* -^ * * * — /• 

* We think of the pattern P as cut out of a rectangular chess- 
board consisting of /j horizontal rows and f* vertical columns, 
and the permutation s as operating on / chess-men occupying 
the / fields. On interchanging rows and columns in P we obtain 
the dual or transposed pattern P*. 

Auxiliary Theorem. A permutation s belongs to [qp) if and 
only if any two pieces originally in the sayne row are not sent into 
the same column by s. 

Proof. It is evident that this condition is necessary in 
order that s belong to (qp). The change of position which one 
of the pieces suffers as a result of .s can be accomplished in two 
moves, a horizontal and a vertical move (in this order). It 
is at first conceivable that the horizontal move could send the 
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piece into a field of the original board which is not contained in 
the pattern P. If the decomposition s = qp is possible p must 
represent the horizontal move and q the subsequent vertical 
one ; it is clear that q and p are thus uniquely determined. 
Now if s satisfies the conditions enunciated in the above theorem 
the horizontal move can never throw them into the same column, 
i.e. the same field. It only remains to show that the horizontal 
move can never send any piece out of the pattern proper, or : 
those pieces which s sends into a column of length f* come from the 
first f* rows of the pattern. We divide the chess-board horizontally 
into an upper and a lower part, the upper consisting of the 
first /* rows. The pieces which are sent into the first column 
by 5 are, by assumption, from f* different rows ; hence there 
are at least (and therefore exactly) f* — /* of them which come 
from the lower part of the board and not from the first f* rows. 
Note that /* — f* is exactly the number of fields in the first 
column which lie in the lower part of the board. On applying 
this argument to each column in succession we find that the 
number of pieces which s sends into those columns which pro- 
trude into the lower part of the board is exactly equal to the 
number of fields in this part of the board. Hence all the pieces 
in the lower part of the pattern are sent into columns whose 
lengths are greater than /*, and the only pieces 5 sends into a 
column of length f* come from the upper part of the board. 

This auxiliary theorem allows us to assert that if s does not 
belong to (qp) then there exist two pieces in a single row which 
are sent into the same column by s. If u denotes the trans- 
po.sition of the two pieces in their initial positions and v their 
transposition in the final then su = vs ; here m belongs to {p) 
and V to {q). 

§ 14. Irreducibility, Linear Independence, Inequival- 
ence, and Completeness 

We now examine the Young symmetry operators c associated 
with the various patterns. Obviously 

c{sp) = c{s), c{qs) = 8, • c{s), (14.1) 

where q are, as usual, elements of (/>), (5), respectively.^^ 

Theorem (14.2). Any element a of (tt) which satisfies equations 
(14.1) : 

a{sp) = a{s), a{qs) = 8, . a(5), (14.3) 


is a multiple of c. 
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To prove this theorem we first note that (14.3) implies 
a{qp) = • a(l) ; 

1 setting a(l) = A the equation 

a{s) = A • c{s), 

hich is to be proved, is certainly correct for all group elements 
of the form qp. We must next show that a(5) = 0 if s does 
it belong to the set {qp). Such an s implies that there exist 
anspositions u and v, lying in {p) and (q) respectively, for 
hich su — vs. But then by (14.3) 

a{su) — a{s), a(vs) = • a(s) — — a(s), 

hence a(s) — — a(s) or a(s) — 0. 

Theorem (14.4). Every element of (rr) of the form cxc is a 
ultiple of c. 

It was shown in the general theory that this theorem is 
ilid if r is a primitive idempotent element of (tt) and if 
le field in which We operate is algebraically closed ; here we 
jproach it from the opposite direction, as we wish to show 
rectly that it hold^ for c in order to prove that c is primitive, 
ow obviously any element of the form xc satisfies the first of 
juations (14.3) and any element cx the second ; hence any 
ement of the form cxc has both properties and is consequently 
multiple of c. 

Theorem (14.5). cc = yc and y is a positive integer which 
contained in /!. 

That cc is a multiple of c follows immediately from the 
revious theorem ; y is therefore the number 

y = Zc{l)c{t') = IJcis) • c{s-^). 

tt' = I s 

et the sub-space of elements of the form xc be of dimension- 
‘ly S- The projection 

X y = xc (1T6) 

rejects any element x into an element lying in this sub-space 
id is, within itself, merely the multiplication y = yx. Its 
ace is therefore yg ; to see this we need merely to adapt the 
)-ordinate system in group space to the sub-space pc. On 
le other hand its trace is immediately obtainable from (14.6) or 

y{s) = Zx{t)c{s-H) ; 

I 

is /!c(l) = /!, hence 

yg=f^- 
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Consider the meaning of this fact that y is positive, i.e. that 
is oftener positive than negative ! 
e = cjy is idempotent ; hence the character of the repre- 
sentation \)c induced in pe by the regular representation is 
by (8.3) 

= (14.7) 

y !• 

We obtain as a by-product the fact that the dimensionality g 
of the representation fjc is a factor of f\. 

Theorem (14.8). is irreducible. 

We know already that this theorem is a consequence of (14.4), 
but it may be instructive to prove it directly as follows. Let 
e = cjy be reduced into two independent idempotent elements 
+ ^2 ; then 

eCi — e-^e — e^, whence ee^e — e^. 

Now by theorem (14.4) any element of the form ee^e is a multiple 
of e ; hence e^ = \e. e^Cy — Cy then yields the equation A* = A 
for the number A. Consequently either A = 1 or A = 0, i.e. 
either ey = e or ey — 0. 

We shall say that the pattern P' with rows of lengths 
/ii /L ■ ■ ’ is higher than P if the first non-vanishing difference 
fy — fy, fi — /j, • • • is positive. 

Theorem (14.9). If the pattern P' is higher than P then 
c'c = 0. 

We do not here assume that the variables are written in 
the patterns P, P' in the normal form agreed upon in the previous 
section — i.e. in which the numerals appear in their natural 
order on reading the pattern as one would a page of a book. 
The proof is based on the fact (F) that there exist two numerals 
which are in the same row in the pattern P' and in the same 
column in the pattern P. If v is their transposition it belongs 
to the group {p') associated with the rows of P' and at the same 
time to the group {q) associated with the columns of P ; hence 

c'jsv) = c'{s), c{vs) — — c{s). 

On replacing vt in 

c'c{s) =- Zc'{st-^)c{t) =: - Ec'{st-^)c{vt) (14.10) 

t ( 

by t alone we find 

c'c{s) = - Zc'{st-^v)c{t) = - Zc'{st-^)c{t) = - c'c[s). (14.11) 
( c 
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(F) is evident if the first row of P' is already longer than the 
first row of P, for it is impossible to distribute the fi numerals 
in the first row of P' over different columns of P if /i < f\. 
If /i = /i and the numerals of the first row of P' are actually 
distributed over different columns of P, we discard the first 
row of P' and the fi fields of P containing the same numerals as 
this row. On shifting the fields of P upward to fill in the gaps 
P is transformed into a pattern which has exactly the same 
appearance as if we discarded the first row of 5 ; we are only 
interested in the fact that this process leaves all pieces in their 
original column. The proof can then be completed by mathe- 
matical induction— by assuming that it holds for the abbreviated 
patterns obtained by omitting the first rows of P and P'. 

Theorem (14.12). Lei c, €',••• be the Young symmetry 
operators associated with different patterns P, P'. • • • ; the corre- 
sponding sub-spaces • • • are then linearly independent. 

Let the P, P', P", • • • be arranged in such an order that 
P is higher than P', P' higher than P", • • •. An element x of 
:p = is reproduced by right-multiplication with cjy but, by 
the previous theorem, this process transforms all elements 
x' of x" of p”, • • into 0. Assume there exists such a linear 

dependence 

X + x' x” + ' ■ • 0 ; 

on right-multiplication with c we find a; — 0 and consequently 
x' x” -f ' • • — 0. The theorem is thus reduced to the 
same theorem for tlie smaller .set P', P", • • •, and the proof 
follows by mathematical induction. 

Theorem (14.13). Different patterns P, P' give rise to in- 
equivalent sub-spaces P<-'- 

The proof is accomplished by a direct derivation of the 
orthogonality relations. Let P' be higher than P. Since we 
did not assume in proving theorem (14.9) that the numerals 
were distributed in the same order in the two patterns P and P', 
we may replace the element c with components c[s) by the 
“ conjugate ” element cv-* with components c{rsr~^) : 

2:c'{sry{rtr-^) = 0. 

t 

Summation with respect to r yields 

IJc’isn • Xe{t) = 0. 

t 

On writing x = Xcj X “ Xc' this formula is equivalent to 

ixisnxit) = 0. 
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In particular 

r/r'ixw = 0. 

t 

If the two sub-spaces were equivalent we would have x'i^) — x(0> 
and since = x(0 the symmetric group the above 

equation would yield 

Zx’M = 0. 

8 

But this is impossible, for by (14.7) the character x{^) has 
rational components, and in particular ;^(|) = g 4= 0. 

This last conclusion is valid only if the number field in which 
we operate is non-modular ; naturally this restriction is irrelevant 
for physics. Nevertheless it constitutes a blemish which should 
be removed, for the remainder of our deductions only introduce 
the minimum assumption that /! is not 0 in the field under 
consideration. Now from the general theory we know that 

Theorem (14.14). =/!• 

8 

The blemish mentioned above is removed by proving this 
theorem directly. We must show that 

Zxis-^) ♦ e{s) = 1 

8 

or 

Ze{rs~h~'^)e{s) = 1 . 

r, 8 

On replacing the summation variable s by sr, where r is fixed, 
this becomes 

Ze{sr)e{s-h-'^) = 1. (14-15) 

r, 8 

Consider next the function 

a{s, s') = Ze{sr)e{s'r-^) ; 

r 

as a function of s it satisfies the second condition in (14.3). 
But the first of these conditions is also satisfied, as can be seen 
immediately by replacing r in 

a{sp, s') --- Ze{spr)e{s'r-^) 

r 

by the summation variable p~^r. Hence by (14.2) 

a(s, s') = c{s) • 2^e{r)e{s'r~^) = c(5) • e{s') = -c(s)c(5') 

r y 
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and therefore the left-hand side of (1415) or 

2:a{s, s-^) = 

» y » 

is actually equal to 1. 

The relations 

rx(^)xV^) = 0or/! (14.16) 

t 

show that the primitive characters obtained by our construction 
from the various symmetry patterns are linearly independent, 
and since their number is equal to the number of classes of 
conjugates in the group tt, any class function can be represented 
as a linear combination of the x(«^)- In particular, the function 
l{s), which is 1 for 5 = | and otherwise 0, must possess such 
an expansion : 

/! • 1(5) = mx{s) -f- m’x'is) -!-•••. (14.17) 

Multiplying by ^nd summing over s we obtain, with the 

aid of the orthogonality relations (14.16), the equation 

• /!x(l) =/'•"* 

or 

(14.18) 

for m. Since 

x(-y) = 2Je{rsr-^) = 2^er{s), 

r r 

equation (14.17) gives the reduction of the modulus 1 into 
primitive idempotent elements er- Hence the regular repre- 
sentation is reduced into the irreducible representations 
associated with the various symmetry patterns. Since /! 1(5) 
is the character of the regular representation, eq. (14.18) is a 
direct verification of the fact — ^proved in the general theory— that 
the number of times each irreducible representation appears 
in the regular representation is equal to its dimensionality. 
This completes our direct and elementary development of the 
theory of the representations of the symmetric group. 

The method of proof employed in establishing theorem (14.9), 
i.e. that cc' — 0 if P' is lower than P, will now be used to answer 
another question. Let a be the operator, introduced in the 
previous section, which symmetrizes with respect to the ciphers 
occupying the rows of P : 

a{s) = 1 Or 0 according as s belongs to {p) or not, 
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and let the numerals be written in the pattern P', which is 
lower than P, in an arbitrary order. I assert that ac' = 0. 
There exist two numerals which occupy the same row in P 
and the same column in P'. If v is the transposition of these 
two numerals then 

a{sv) = a{s), c'(vs) = — 

and the assertion is proved with the aid of (14.10), (14.11) on 
replacing c', c there by a, c' . Hence also 

Za{sl-^y{rlr~^) = 0 , 

t 

i;a(^ri)x'(0 = 0 or 2'^(r-i)x'N = 0. 

f r 

That is, the sum of the x{t) extended over all elements i = rs 
which are left-equivalent to s mod. {p) [i.e. r in (^)], is zero. 
In particular, 2Jx{^) “ where the sum is extended over all 

s 

elements s of (p) ; x' the character associated with a pattern 
P' which is lower than P. On applying this result to the con- 
siderations of § 8 (in particular, to (8T3) ff.) we find : 

If the individual I has the simple energy levels E-y, E^, the 
term 

+ fuEz + ■ ’ ■ (/i ^ A ^ • • *, /i + A + ' * ' — f) 

of the unperturbed system P appears only in those symmetry 
classes of tensors whose pattern P' = P{fi\ ff ' ’ ’) is not lower 
than P = P(/i/2 • • •)• 

Thus we saw in discussing the two-electron problem that 
terms of the form Ei -f Ej appeared in the “ anti-symmetric ” 
as well as the “ symmetric ” term systems, whereas terms such 
as 2Ei appeared only in the latter. 

Finally, we consider the relations existing between two 
dual patterns P and P* with generators c, c* and characters 
X, X*- The group (/>) which permutes the members of each 
row of P among themselves coincides with the group (q*) which 
permutes the members in each column of P* among themselves ; 
similarly {q) — (/>*). U s — qp is in (qp), then 5“^ = = 

q*p* is in (q*p*), and conversely ; for such an element 

c(5) = 8g, r*(5-i) = 8,. = 8p. 

Hence in general — even when s is not in (qp) and, consequently, 
s~^ is not in {q*p*) — we have 

c*(5-i) = 8. • c{s). 
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“ Dual ” elements c, c* are therefore related to each other in 
exactly the same way as the “ duals ” introduced in § 12. 
Further 

y* = r ; = x*(^) = • x(^) ; g* == g- 

If P is higher than Q, then conversely P* is lower than Q*. 
For if we lower P by taking away the last field of one of the 
rows of P and adding it to the end of a later (shorter) row, one 
of the columns of P is increased at the expense of a later (shorter) 
column ; by such a process of shifting individual fields, in which 
no gap is to occur in a row or a column, P can be transformed 
into the lower pattern Q. 

§ 15. Spin and Valence. Group-theoretic Classification 
of Atomic Spectra 

If the vector space ~ 9^2 is only 2-dimensional, the only 
symmetry patterns P which give rise to primitive symmetry 
classes of tensors of order f are those which consist of at most 
two rows. Let the first row contain / v fields and the second 
I ; then 

• - 2 /. 

The symmetry pattern P is thus uniquely characterized by the 
number which we call its valence, and v may assume any of 
the values /, / — 2, / — 4, • • •. Let be the totality of tensors 
of the form cF obtained by applying the Young symmetry 
operator c associated with the pattern P to the totality of tensors 
F, and let be the representation of the linear group, the 
substratum of which is the tensor manifold A sufficiently 

general tensor of order / which is symmetric in the first as well 
as the second rows of indices is given by 

JXJX'-'XEXJ {I + V terms) 

X t) X ij X • • • X t) {I terms), 

where 

I = {Xi, Xz), ^ = (yi, ya) 

are two arbitrary vectors. On alternating with respect to the 
columns we find that the representation of the linear group 
C == Cj is that one which is induced on the quantities 

{Xiy2 — XiyiY • x\' {ri + ra = y). 

Hence is the representation of the linear group which was 
denoted in III, § 5, by 
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This remark supplies the connection with the symmetry 
problem of quantum mechanics as dealt with in § 12 — on apply- 
ing the Pauli exclusion principle when the existence of the spin, 
but not its dynamical effect, is taken into account.^® Since 
the spin space is 2-dimensional, formula (12.10) tells us that 
the only patterns P which give rise to a term system are those 
whose duals P* consist of at most two rows, i.e. those P which 
themselves have but two columns. If v is now the number of 
fields by which the first column of P exceeds the second we call 
V the valence of the term system or of the corresponding state of 
the atom. The multiplicity of the term system with valence 
i; is z; + 1, to each of these possible multiplicities corre- 
sponds but one term system as we have already seen in § 12 
(in particular p. 356). We previously (Chap. IV) called s = z ;/2 
the “ spin quantum number.” 

The fact that the longest column of P cannot exceed the 
dimensionality N of the vector space 91 ^ associated with the 
electron translation may result in a further restriction on the 
possible symmetry patterns P. This situation cannot arise 
as long as we deal with the total 00 -dimensional system space. 
On the other hand if we restrict ourselves, for example, to those 
states of the electron which are characterized by a fixed principal 
quantum number n and a fixed azimuthal quantum number I 
— and which therefore constitute a (21 -3- l)-dimensional sub- 
space 9 ^(n/) within — i.e. if we consider only those states of 
the atom in which all the / electrons outside a closed core are 
in 9 i(n/), the dimensionality N is reduced to 21 + 1 . Then / 
cannot exceed 2 ( 2 / -|- 1 ) and the possible valences of the states 
under consideration are given by the following table : 


II 

1, 2, 

3, 

4, • • • 

• • •, 4/, 4/ + 1. 4/ + 2 


1 0 

1 

0 • • ■ 

• • • 0 1 0 

V 

2 

3 

2 • • • 

• • • 2 




4 



This table again gives us the alternation law, but shows that in 
addition the number of possibilities decreases from the middle 
of the table on. The possible multiplet numbers 2^+1 of 
terms in these states is one greater than v. 

This “valence” v, which describes the symmetry state of 
the system, is actually the chemical valence, as was shown by 
JF. London,^^ We allow two atoms, consisting of /i, /g electrons 
respectively, to come together to form a molecule with f — 
electrons. Let ^1, ^2 be irreducible invariant sub-spaces of 
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the system spaces respectively. In order to find which 

symmetry states the molecule is capable of assuming when the 
first atom is in the state and the second in ^2 'we must com- 
pletely reduce the space X ^2 ‘fito irreducible constituents. 
If we consider this decomposition as taking place in the vector 
space of electron spin rather than in that of electron trans- 
lation (the justification for which will be given below), the 
problem is solved by the Clebsch-Gordan series (III, 5.9) ; it 
tells us that if the valences of the symmetry states of the two 
atoms are v^, the resulting symmetry states of the molecule 
are those with valences 

V Vi, Vi-{- Vi~2, Vi-^ Vi — ■ ■ •, |yi — Vi\. (15.1) 

This situation can be readily visualized in terms of the symmetry 
patterns as follows. Bring the two symmetry patterns Pi, Pi 
of the two atoms into the positions shown in 
the accompanying diagram and then shove 2 
vertically upwards, one field at a time, until one 
of the two columns of the combined pattern is 
closed ; each of these steps represents a possible 
symmetry pattern •for the molecule, in which v is 
the number of fields which are not paired hori- 
zontally. The saturation of the valence bonds 
here appears as the pairing of fields or, more physi- 
cally, as the saturation of the spin of an electron 
in one of the atoms with that of an electron in the 
other. The empirical theory of the valence bond 
has therefore a rather profound significance. 

We have yet to justify our use of spin space 
rather than translation space in the above. Let the representa- 
tion of the permutation group tt/ corresponding to the two- 
columned symmetry pattern of valence v be denoted by ; its 
dual consists of but two rows. The Clebsch-Gordan series, 
together with the third reciprocity theorem of § 10 as applied to 
the linear group C = C 2 , tells us that on restricting v to the sub- 
group 7t' = iTi X TTi which permutes the electrons of each atom 
separately the representation of tt contains the irreducible 
representation X of tt' once or not at all, according as 

V is one of the values (15.1) or not. From this it follows im- 
mediately that the same result holds for the duals on reducing 

after restricting tt to tt'. Applying the same reciprocity 
theorem in the opposite direction for the case in which C = C„ 
is the linear group in n dimensions, we find that the representa- 
tion X §», of c (or the algebra Z) contains the representation 
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once or not at all according as v is one of the values (15.1) or 
not. On reducing X into its irreducible constituents 
we may expect to find other representations — which may even 
occur more than once — in addition to these simple but these 
additional representations will correspond to symmetry patterns 
with more than two columns and are, in virtue of the Pauli 
exclusion principle, of no importance for physics. The number 
b introduced in § 11 is accordingly at most equal to 1 in the case 
of diatomic molecules. 

Molecules which consist of a larger number of atoms can 
be studied by the same method. If in particular we are in- 
terested in the case of three atoms and their valences are Vi^ 
we can determine with the aid of the Clebsch-Gordan series 
the number of times the representation occurs in the 
reduction of X X Those v for which b^ ^ 0 are 

the valences of the possible symmetry states of the molecule 
and b ~ by (which may here be greater than 1) are the corre- 
sponding multiplicities. The characterization of the quantum 
and symmetry states of a molecule which is formed by the 
union of three atoms in given quantum and symmetry states 
requires, in addition to the valence a further index which 
distinguishes between the various by possible energy levels. 
But this description of the various possibilities differs from the 
empirical theory of the valence bond — the manifold of possible 
bindings is smaller. 

Classification of Spectral Terms, 

Let the unitary or the complete linear group in the system 
space 9R of the single electron be restricted to the group X 
of transformations 5^ X 5^, the two factors of which are trans- 
formations of the spin and translation spaces respectively : 

X The space {W} of anti-symmetric tensors of 

order / is then reducible into irreducible invariant sub-spaces 
with respect to the algebra of symmetric transformations of 
the form (12.2). We thus obtain a distribution (I) of spectral 
terms among the various symmetry classes ; this step is of 
universal validity and is applicable to molecules as well as 
atoms. 

The further classification of terms, as discussed in Chapter IV, 
A, refers to “ simple ” rather than “ quantum ” states, i.e. to 
those states which are related to spatial rotation and moment 
of momentum in the same way that the quantum states are 
related to displacement in time and energy. Naturally this 
application of the rotation group b = ba (the elements of which 
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we now denote by r, • • •) is significant only for atoms (or 
ions), the nuclei of which are considered as fixed centres of 
force. So long as we concern ourselves only with the electron 
translation and neglect the mutual perturbations of the electrons, 
which are characterized by principal and azimuthal quantum 
numbers n and /, each individual term of the system is char- 
acterized by the quantum numbers ; ng, ‘ ‘ ^/)* 

The number of times such a term appears in a given symmetry 
system is equal to the dimensionality of the linear sub-space 
in which the atomic states under consideration lie. The resolu- 
tion caused by the mutual perturbations parallels the reduction 
of this sub-space into its irreducible constituents St/, with respect 
to the group b of rotations ; the resulting components of the 
term have the natural multiplicities 2L -1- 1. The spin space is 
similarly to be reduced. Let b induce the representations 
: or -> U{g) and 6 : o- V[g) in and respectively. This 
second step (II), in which the spin and translation spaces are con- 
sidered separately, is interpreted from the stand-point of group 
theory as meaning that we associate with the element (a, r) 
of b X b the transformation U{g) X L(t) ; we thus obtain a 
6-parameter sub-gfoup of Cv X C„, and on restricting Cv X Cn to 
this sub-group our original irreducible sub-space is further 
completely reducible into irreducible constituents. The irre- 
ducible representation of b X b induced in such a sub-space is 
of the type X The final step, (III), consists in introducing 
the coupling a — r \ the 6-parametcr sub-group is thereby 
restricted to a 3-parameter sub-group, i.e. that sub-group 
induced in the total system space by the rotations b. The 
spin perturbation then resolves each such term multiplct into 
its (at most 25 -T 1) components : 

X (i — ^ I s — 1, • • *, |/ — 5|) ; 

i 

naturally X 2)? is here a representation of b instead of b X b. 

Actually v ~ 2, and the transformations induced in the 
spin space SRg by the rotation group constitute the unitary group 
in two dimensions. Consequently the transition from to I)^ 
in step (II) involves no reduction in spin space — this is the 
essential simplification caused by the fact that 9^^ has so small 
a dimensionality. 

To the symmetry system of terms corresponds a certain 
irreducible representation of the unitary group U in the space 
of the electron translation and with it a certain irreducible 
characteristic (§ 9) 

X X(e„ • • •). 
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The co-ordinates Xi in the space Stj are broken up into classes 
in the manner described in Chapter IV, § 1 : 

x{m) [m 1, • • •,—/]; 

x'{m') [m’ = • • •. 

Each of these classes describes a (2/ -p l)-dimensional sub-space 
9i(«/) of 3it in which the group 63 of spatial rotations induces 
the irreducible representation %i and is characterized by the 
principal quantum number n and the azimuthal quantum number 
1. The arguments of X are correspondingly broken up into 
classes. To give the principal and azimuthal quantum numbers 
of the individual electrons — without stating how these numbers 
are distributed among the / electrons — we need only to state 
how many (/') electrons are represented by states in each of 
the various sub-spaces 9i' = 9I(w/). If, for example, 3 of the 
electrons are in 91' and the remaining 5 in 91" [f = 8) we must 
separate out that part of X which is of degree 3 in the variables 
6,- belonging to 91' and of degree 5 in those belonging to 91". 
The multiplicity M of the corresponding term 


E{nili) 4 " E[n2l^ + * ■ ■ + E{n/lf) 


of the “ unperturbed ” atom in the symmetry system under 
consideration is then obtained from the part of X described 
above by setting all s contained in it equal to unity. In order 
to determine how this M-fold term is broken up on taking the 
mutual influence of the electrons into account we replace the 
variables e{m) of the class 9 I(h/) by s(w) = e”*, the variables 
e'(ni') of the class 9I(n's') by e'(m') = £”*' (with the same e), etc. 
The resulting expression must be a linear combination of the 
sums 


4 - L 


2’em 

— L 




£“ 


L 


£ — 1 


with non-negative integral coefficients. This enables us to 
tell which of the various total azimuthal quantum numbers L 
appear, and how often, in the resolution of the above term ; 
each such L-term has still the multiplicity 2L -4- 1. 

Example. We consider, as an example, the case in which 
/ = 3 and all three electrons are in the same sub-space IRinJ). 
The possible symmetry patterns are 


n Lu rcLi 


□ “ 
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The Pauli exclusion principle allows only the first two ; their 
valences are u — 3 and v = 1, and the corresponding terms are 
therefore quadruplets and doublets, respectively. The first 
pattern defines the anti-symmetric tensors of order 3 and the 
third the symmetric tensors. The corresponding characteristics 
are therefore 

Xj = .Ji = 2^ £,6,Sfc, Xj = EiBjSic- 

i <j <k i 

On introducing 

, * + i 

we have Xg ^ + *^2 + '^3* The dimensionalities of the re- 

presentations of TTg corresponding to these three patterns, and 
therefore the numbers of times the representations X^, Xg, Xg 
of C appear in (c)^, are easily shown to he 1, 2, 1, (in accordance 
with the equation 3! ^ P + 2^ + P). Now the characteristic 
of the representation (c)^ of c is 

b - = ^3 + 3^2 + 65, ; (15.2) 

i 

the equation . 

h ~ + 2X2 + Xg == (2^1 + *^2 4" >^3) + 2X2 

then allows us to conclude that 

X2 " ^^2 + 25i. 

We prefer to carry out the evaluation with the aid of the sums 
of powers 

hi h — h — ; 

i i i 

we then have 

h “ '^3 '^2) h “ '^3 

in addition to (15.2). Consequently the characteristics in 
which we are interested are : 

Doublets : X^ = |(/i - t^), (15.3) 

Quadruplets : X, ^ ^ i(q — Q — (^2 — t^) . (15.4) 

The solution of the problem discussed above is now obtained 
by replacing the 2/ + 1 variables s,- by the set 

el“-l • • • e ^ 

S I S > I ^ 
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and then expressing h as a sum 2ai{L) of expressions of 

the form 

(L) ; e'' + + • • * + 

with integral multiplicities a^. The computation is considerably 
simplified by multiplying both sides of the equation by e — 1, 
as (L) then becomes — e~^ The multiplicities so obtained 
are given in the following tables ; 


1 .^^1 

L = 

- 31, 31 - 1, 

, 31 - 2, • • • 

, l\L 

-0, 1 


Multiplicity : 


1, 2, 

3, • • • 

1 

1, 3 

, 5 , • • • 



(increasing by i each step) 


(increasing by 2 






each 

step) . 

1 ^2 1 


L - 31, 

31 - 1, 3/ - 

2, 31 - 



Multiplicity : 


1, 

0, 1, 

0. 

• • 

•, 1’ 




(alternately i 

[ and o) 






1,1- 

1, i- 

2 , 

3, • • •, 0 




1, - 

i, 1, 

___ 

1, • • • 




(alternately 

1 and — 

>) 

i ^3 ! 

L = 

= 31, 31 - \, 

31 - 2, 31 - 

• 3. 31 - 

- 4 , 31 

- 5, • • • 

Multiplicity , 


1, -1, 

0, 1. 

— 

1, 

0, • • • 




(repetition with 

1 period 

3 ) 



On applying these results to the computation of Xj, Xj with the 
aid of (15.3) and (15.4) wc find that the number of terms with 
total azimuthal quantum number L is as given in the following 
tables : 

Doublet System 


L = 0, 1, 2, 1 

1 3 , 4, 5, 1 

. . . 

0 1 2 1 

12 3 4 I 

. . . 


up to L =-- 1. The period is here 3 ; the multiplicities in the 
second period are those of the first increased by 2, those in the 
third are obtained from those in the second by adding 2, etc. 

^2') L — 31, 1, 3/ — 2 I 3/ — 3, 31 — 4, 3/ — 5, | • • • 

0 i 1 ^ “1 2 [ 

down to L := /. The periodicity is again 3, but the multiplicities 
m each period are obtained from those in the previous one by 
adding 1 instead of 2. 

Quadruplet System. The periodicity is here 6 instead of 3 
(I) For the values of L from 0 to I the first period of multi- 
plicities (L = 0, 1, 2, 3, 4, 5) is for even I \ 0 1 0 2 1 2 and for 
odd l\ 10 112 1. The multiplicities increase by 2 from period 
to period. 
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(2) For values of L from 3/ down to / the first period is 
3 0 0 1 0 1 regardless of whether L is odd or even, and the 
Tiultiplicities are increased by 1 from period to period. 


§ 16. Determination of the Primitive Characters of 

tt and TT 

The guiding principle in the whole of the present chapter 
[s the reciprocity between the symmetric permutation group tt/ 
and the algebra 2 of symmetric transformations. But this 
latter can, as was shown in § 1, be replaced by the special 
symmetric transformations induced in tensor space by the linear 
transformations of vector space and which constitute a group 
[c)*^ isomorphic with the linear group C. Indeed, we may even 
restrict c to the unitary group ll. The algebra 2 is thereby 
referred to a group — not to a finite group, it is true, but to a 
dosed continuous group. Now we have seen in Chapter III 
that we may expect such groups to behave in a manner entirely 
analogous to that met in dealing with finite groups, at least 
if we concern ourselves only with unitary representations. As 
a rule we find in mathematics that the continuum is more easily 
liandled than a discrete manifold ; the formula (9.11), which 
:)xpresses the fundamental reciprocity mentioned above, will 
therefore better serve to compute x fn>ni X than the converse. 

We therefore next evaluate the characteristics X of the 
continuous irreducible unitary representations of the u-dimen- 
fional unitary group ll by a direct method which is independent 
af our previous development. The case n 1 has already been 
solved in III, § 8 ; the procedure there developed serves as 
a model for the present case. With this in mind we first prove 
the following auxiliary theorem : 

A continuous function /(coj, absolute value 

1 which possesses the period in each of the u real arguments 
and which satisfies the functional equation 

/((a> + a>'))=/(H)/((a,')) 

is necessarily of the form 

/\(^)) “ e[hia)i 4" ^2^2 "i* * * ' "f* 

where the constants h arc integers. 

On introducing the n functions 

/iM 0, 0, • • *, o),/2(w) =/(o, <0, 0, • • •, 0), • • • 
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of one variable, we are able to conclude from the functional 
equation above that 

/(a>i, 0^2, • • •) — /l(^l)/2(^2) 

It therefore suffices to prove the theorem for functions f{cu) of 
one variable, and this we have already done [III, § 8]. 

Every element 5 of the group u is conjugate to a “ principal ” 
element E, i.e. to a transformation of the form 

(v = 1, 2, • • •, n). (16.1) 

The numbers Sp are of unit modulus and may therefore be ex- 
pressed as 

i<o / \ 

£p e e[C 0 pj 

in terms of the “ angles of rotation ” coj, a> 2 , * * •, (which are 
only determined mod. 27r) of the unitary transformation 5. 
In order to employ the orthogonality relations it is necessary 
to determine the volume dS of that portion of the group mani- 
fold U whose elements have angles between coy and ojy + 
aj, ^ 2 , • • •, being any n numbers, let D(aj, ag, • • •, a^) denote 
the product 

IT (a,- — a*) == I a"-!, • • •, a, 1 [ 

i <k 

of differences ; the n rows of the determinant on the right are 
obtained by replacing a successively by « 2 ) ‘ ' ’» ^n- The 
evaluation of the volume element dS will be carried out in the 
following section ; we here anticipate the result 

dS AAdajidoj2 • • • daj„j A = D(ei, Sg, • * *, £„). (16.2) 

The determination of the primitive characteristics of U is 
accomplished by combining the following important facts. 

1. Symmetry. — Each element 5 of u is conjugate to a prin- 
cipal element (16,1). Hence it suffices to determine the 
characteristic X of a continuous representation of U for such 
a principal element. E goes over into a conjugate transforma- 
tion within u on permuting the : hence X is a continuous 
symmetric function of the angles Wy and is of period 2 tt in each 
of them. 

2. Arithmetic Properties. — The principal elements constitute 
an Abelian sub-group of U ; on compounding two such elements 
£, £' the angles coy^ are added. The normal co-ordinates 
y*. in representation space 91 can therefore be chosen in such a 
way that the principal elements correspond to principal trans- 
formations 


e. yk-^pkyk; 
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indeed, we have shown in I, § 5, that any commutative system 
of unitary correspondences can be brought simultaneously into 
diagonal form. On compounding two principal elements the 
condition that E be a representation is expressed by the functional 
equation 

0)2, • • 0)2, • • •) = p(o)i + 0)'i, 0)2 + 0)2, * • •) 

for each of the multipliers p p^. The auxiliary theorem then 
tells us that each pj^ is of the form 

e{hiOJi + * * * + 

where the constants h are integers. The characteristic of the 
representation is the sum of these ; hence X is a finite Fourier 
series in the arguments a> with integral non-negative coefficients. 
The “ voeights ” of a representation are the sets of exponents 
(^ 1 , ^ 2 , ‘ ’ *, ^n) c>f each term 

e[hi(x}i + h^2 + • • • + hnOJn) == eS* • • * 

which actually appears in X. The term * * •, is said 

to be higher” than {h[, • * *, if the first non-vanishing 

difference — h\^ 1^2 ■“ ‘ is positive. 

3. Orthogonality. — For all primitive characteristics X the 
integral 

2n 2n 

I • * • J X X A A daij • • * da>n 

0 0 

must have the value 

2n 271 

F = J • • • {AAJo)i • • • d<o„. (16.3) 

0 0 

These orthogonality relations suggest that we introduce the 
quantities ^ = A • X in place of the characteristics X ; they 
are also finite Fourier series, but they are anti- symmetric functions 
of the angles co instead of symmetric ones, /tj, /^ 2 » ‘ being 

integers arranged in decreasing order 

> /t2 > • • • > /^n, (16.4) 

we construct the ” elemental sum ” 

^(^1, ^2, * ’ ■, ^n) “ ib e{h^(x}i + h2pi2, + • * * + (16.6) 

i.e. the alternating sum over the permutations of the arguments 
o ) ; the term which we have written down is the highest one 
in the sum. Every alternating Fourier series is a linear aggregate 
of such elemental sums ; since the coefficients of these sums are 
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integers, and in particular that of the “ highest ” term is 1, 
every alternating Fourier series, such as with integral co- 
efficients can be expressed as a linear aggregate of the form 

^ = c - i{hi, h^, • • •) + <^' ' ^(^ 1 - •'■)+•• • ( 16 . 6 ) 

with integral coefficients c, r', • • •. Let this expansion be 
arranged in decreasing order, i.e. in such a way that the set 
(/tj, h 2 y • • •) of exponents is higher than {h[, * * •), etc. ; 

(^ 1 ) ‘ then the highest term in A is itself an elemental 

sum, namely 

A =- i{n - 1, n — 2, • • -, 1, 0). 

Hence if the highest term in X has exponents /j, /g, • • we have 

K=h+{n-l), • • -, /r„_, -/„_!+ 1, /f„-/n; (16.7) 

in the followmg the numbers /,- and are always in the relation 
(16.7) with one another. 

We denote integration with respect to all the angles of 
rotation from 0 to 2tt by a single integral sign and write dco 
for dojid(jt )2 • • • dojn. We now calculate 

K, ■ • K, ■ • ■)dco ; 

the h and the h^ are arranged in decreasing order in accordance 
with (16.4). Consequently no permutation of the h can coincide 
with a permutation of the W unless 

hi — hi^ h2 ~ ho, * • *, hn — hj^] ( 16 . 8 ) 

the integral of each of the (n ! )2 terms in the product 

I{hi, /^ 2 j * ’ *) ^{hiy h^, • • •) 

is therefore 0 unless (16.8) holds. In this latter case those n! 
terms, for which the permutation of the h is the same as that 
of the h' , each contribute (27r)^ to the integral and all others 
contribute 0 ; hence 



according as (16.8) holds or not. Applying this in particular 
to the elemental sum A, we find 

jA A<fw = F = n ! (27r)«. 

On setting the expansion (16.6) in the equation 

= V 
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we find \c\^ + \c'\^ -f • • • — 1. Since the c, c\ • • • are non- 
vanishing integers only the first term can appear in (16.6), and 
we must have c ~ I or — 1, and since the coefficient of the 
highest term of ^ (as of X) must be positive we are restricted to 
the first alternative r—I. We have thus shown that every 
primitive characteristic is of the form 


y ^(^1. K • • •) 1 s'**, s'**. • 

• •, s'*" 1 

^ A |£"-\ • • 

•, s, 11 


where the hi are mtegers arranged in decreasing order : hi > h 2 > * • 
The function defined by (16.9) is a finite Fourier series with 
the highest term (/j, / 2 , • • *, /„) ; the coefficient of this term, its 
multiplicity, is 1. 

4. Completeness. — The last question to be answered asks 
whether every function of the form (16.9) is conversely the 
characteristic of some irreducible representation of u or not. 
Our explicit algebraic construction allows us to answer this 
question in the affirmative. To show this we first note that the 
representation of order / arising from the symmetry pattern 

with (at most n) r#ws of lengths fi, / 2 , * * *, fn bas as highest 

weight (/i, /s, • * •, fn) ; this can be seen immediately by con- 
sidering the representation as generated by alternation from 
the product of n vectors, the first of which occurs fi times as 
a factor, the second / 2 , etc. (as in the simple case at the beginning 
of § 15). The / arc here any integers satisfying the conditions 

On dividing the transformation corresponding to the arbitrary 
element .S’ of U in this representation by the power of the 
determinant of 5 (/ being any fixed non-negative integer) the 
highest weight of the resulting transformation is (/^ — /, 
A — f * ‘ h /n 0 i ^bis simple device thus enables us to dis- 
pense with the restriction ^ 0. We have thus proved that 

all irreducible unitary represejitations of the unitary group Un 
are obtainable by completely reducing the representations (u)-f for 
/-- 0, 1, 2, • • • into their irreducible constituents and 7nultiplymg 
by the \-dimensional representations 

5->(det. 5)^ [/-O, ±1, ±2, • • •]. 

We have further shown that the characteristic of the irreducible 
representation § ~ §(/i, A, • • *, A) of order f of u, which is gener- 
ated by the symmetry pattern P(A, A» * ’ h A), given by equation 
(16.9). 
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We could also have obtained this last result with the more 
transcendental method of proof employed in steps 1 to 3. If 
we are operating in the continuum of all complex numbers 
rather than an arbitrary field the proof of the completeness of 
the irreducible representations of a finite group can be formulated 
in such a way that it can be taken over immediately for the case 
of a closed continuous group with the aid of the theory of integral 
equations. The particular application of this general group- 
theoretic completeness theorem to the group i )2 of rotations of 
a circle into itself yields the completeness of the Fourier orthog- 
onal system (m — 0, ±1, ± 2, • • *). Its application to 
the closed group yields the following two facts : (1) Every 
expression of the form (16.9) is in fact a primitive characteristic. 
For if it were not it would be a non-vanishing function of position 
on the group manifold — in fact, a class function — whose Fourier 
coefficient with respect to each irreducible representation 
vanishes ; it is indeed orthogonal to all other functions of the 
form (16.9). (2) We further find that the functions (16.9) 

constitute a complete set of orthogonal functions for symmetric 
periodic functions of ajj, CO 2 , • • co^ ] this result is of no particular 
interest, as it is a consequence of the completeness of Fourier's 
orthogonal system in one dimension. Our general considerations 
(1) to (4) yielded so many properties of primitive characteristics 
that we were able to obtain an explicit expression for them from 
these properties alone. 

Consequences, — The assumption that ~ ^ 0 constitutes 

no actual restriction ; the characteristic is then a symmetric 
rational integral function of the £ of order /. The £ are in fact 
roots of the characteristic polynomial /(r) =: det (rl — S) of 
the unitary transformation 5 ; it is therefore possible to express 
X rationally and integrally in terms of the coefficients of this 
polynomial, and therefore in terms of the coefficients of the 
matrix S. The restriction to the unitary group can then readily 
be removed, but we shall not go further into these considerations 
here. 

The dimensionality of the representation X is found by 
calculating X for the unit element, all of whose characteristic 
numbers £v are 1. On substituting directly in (16.9) we obtain 
the indeterminate form 0/0, so we proceed as follows. Take 
coi = [n — l)a>, a >2 = (^ — 2)a;, • • •, = Ooi 

in terms of the single angle co. The determinant in the numerator 
of (16.9) is then the alternating sum of the terms obtained from 
the product 

eih^J^n — l)6fj) • e{h2{n -- 2)ai) • • • e(hyfio)) 
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by permutations of the numbers « — 1, n — 2, • • •, 0 ; it is 
therefore equal to 

\{e{haj)Y-^, • • •, {e{h<o)Y, l] 

or to the product of the differences of the expressions e{hiw), 
e{hjU}), • • • obtained by subtracting any member of the set from 
any of the earlier members. On allowing to -> 0 we have 

e{hi<o) — e(h^) ~ io){hi — h^. 

The dimensionality N of the representation denoted by 
<*p(/i. fi, ' ' /n) in the above is consequently 

(16.10) 

Evaluation of the Characters of rtf . — Having obtained explicit 
expressions for the characteristics of the representations of U„ 
we now employ the connection between the representations of 
TTf and ll„ developed in § 9 to evaluate the primitive characters 
of TTf. In equation (9.12) x is the character and X the char- 
acteristic of the irreducible representations of tt/ and U„, re- 
spectively, generated by the symmetry pattern P{fi, fz, • * •) I 
in particular w'e must put X 0 if the pattern has more than 
n rows. The sum is extended over all possible symmetry 
patterns P with /fields. The expression (16.9) for X then allows 
us to enunciate the following rule for the evaluation of x • Let 

»./. ■ ■ ■ {•,•> ■ ■ ■) ( 16 . 11 ) 

denote the value of the character of the irreducible representation 
Mfu /a. • * •) ”^ ■"■/> which is generated by the symmetry pattern 
fz, ■ ' ■). for an element s belonging to the class I = [ifiz • • •)• 
Choose an arbitrary positive integer n and construct the sums 
^ 1 ) ^ 2 , ' ‘ ' of poivers of n independent variables e,, £ 2 , - - e„ and 
the product D{ti, £ 2 , • • •, Sn) of their differences. The term (16.11) 
is then the coej^ient of the term £}• £*’ ' ' • ej" [A, = fi + (« — f)] 
in the expansion of 

• •,e„)-aVa^ • • -. (16.12) 

We here assume that the pattern P has at most n rows ; hence 
if we wish to obtain all primitive characters of tt/ we must choose 
w ^ /. The rule shows that the components of the characters 
are integers. 

This result was obtained by Frobenius in a purely algebraic 
manner, without introducing the continuous group U.*® But 


P{K K 

D{n - 1, • 


K) 

, 1 , 0 ) 
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I believe that the real reason for the rule comes to light only 
when we consider this connection between the groups tt/ and 
Un — in particular, it enables us to understand why a second 
integer n in addition to / is involved. 

The dimensionality g of / 2 , • • •) is obtained by substitut- 
ing the argument s — I : i^= i^=:z {) in the 

character;^. Formula (9.12) is then 

== Eg^, 


where the sum is extended over all patterns P{]\, f-^, • • •)• Since 
ai is the characteristic of the «-dimensional representation 
C : S -> S of the group U by itself, this merely means that in 
the complete reduction of (c)^ the irreducible representation 
.*p = fi, ' ' ') appears exactly g times, as we already know. 
On substituting the explicit expression (16.9) for X we obtain 

a{ • le"-!, • • •, E, 1| = Zg • [e^S £^ • • •, £*'i. 

g is accordingly equal to the coefficient of ej‘ e*’ • • • s*" in the 
expansion of the product on the left-hand side. The term 
it £i‘£ 2 ‘ ■ ‘ ■ in the expansion of the determinant must 
be multiplied by the term 


/! 


{h, - kfi ! (^2 - kf } ! 


p/ii— 1*1 

Si 62 


of a{ in order to obtain a contribution to the term • * • 

of the product, • • *, here run through the per- 

mutations of n — 1, • • •, 1, 0 and g is accordingly equal to the 
alternating sum 

^'2 ^ {h. - 

ik) 

over these permutations, or equal to the determinant 


/! 


1 


{h — n 1) !’ 


/! 


h,\ h^\ 


1 1 I 

’ (/i - 1) !’ h\\ 

^ h{h — 1) • ■ ■ {h — n -f 2), ■ ■ •, h, 


1 . 


The rows of this determinant consist, on reading from right to 
left, of polynomials in h of degrees 0, 1, • • •, [n — 1) with highest 
coefficient 1. The determinant is therefore 
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and we finally obtain the simple formula 


f \D{K, h,r • • , K) 
h,\h.,\ ■ - ‘ h„\ 


(16.13) 


n is to be taken at least as large as tlie number of rows in the 
pattern P{fi, • • •) ; the reader should convince himself by 
direct calculation that the value of (16.13) remains unchanged 
on replacing n by n + I- 

Frobenius’ rule for tlie character and this formula for the 
dimensionality are vastly superior to (14.7) for purposes of 
practical evaluation. 

As an example, we carry through the computations for the 
case of four electrons ; the results are given in the table below. 
The group contains twenty-four elements which are divided 
into five classes of conjugates ; each of these classes is designated 
in the second column of the table by the values (iji^ * • •) as- 
sociated with it. The first column contains the number of 
elements in each of these classes, and the sign -f or — indicates 
whether the class consists of even or odd permutations. Each 
of the five remaining columns contains the values of a primitive 
character for the classes in whose row they stand. The symmetry 
pattern to which each of these characters belongs is indicated at 
the head of the column by the numbers /i, /j, • • • of elements in 
its rows. The first and the last of these columns may be filled in 
immediately, and the second and third with the aid of Frobenius’ 
rule. The fourth is then obtained from the second on noting 
that its symmetry pattern is the dual of that of 2 ; we need 
then merely to replace the values in the second column by their 
negative for the (-) -classes. Since patterns 2 and 3 contain 
but two rows we may take n = 2. Hence on writing x, y in 
place of Si, $2 we have merely to find the coefficients of x'^y (for 
the column 31) and x^y^ (for the column 22) in the following 
polynomials : 

{x - y)(A: + y)\ 

{x — y)(A; + y)-{x^ -f y^) (x y){x‘^ — y^){x^ + .v*) 

= (a: + y){x* — y^), 

{x - y){x^ + yy, 

{x — y)(A: + y){x^ -f y®) = (.r® — y^){x^ -j- y®), 

{x - y){x* + y*). 

The dimensionalities of the five irreducible representations are 
contained in the first row ; they are 1, 3, 2, 3, 1. The verification 
of the orthogonality relations is left to the reader. 
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No. 

Elements. 

"^xPattern. 






Class. 

4 

31 

22 

211 

1111 

1 + 

4 

1 

3 

2 

3 

1 

6- 

21 

1 

1 

0 

-1 

-1 

3 + 

02 

1 

-1 

2 

~1 

1 

8 + 

101 

1 

0 

-1 

0 

1 

6- 

0001 

1 

-1 

0 

1 

-1 


§ 17. Calculation of Volume on it 

Consider the line elements going out from the unit point f 
on the group manifold u, i.e. the infinitesimal unitary trans- 
formations 8S — IlS^a^ll- We may take as the real components 

of this “ vector ” the n quantities ^ . 8s„„ and the real and 

imaginary parts of the n{n — l)/2 quantities 8sot0 (a<j 8) ; the 
total number of components is thus n^, which is therefore the 
dimensionality of the group manifold tl. No.w in a linear algebra 
of this kind we may replace any two real quantities a, b by the 
complex quantities a ^ ib, — a ib obtained from them by 
a simple linear substitution ; we may therefore replace the 
real and imaginary parts of bsag (a < ,8) by itself and 

8Sag = 

On transporting such an infinitesimal vector to the point 
5 on the group manifold by a left- translation its terminus goes 
into the point 5 -j- dS — 5(1 + 85), dS = 5 • 85 ; we must 
therefore consider the infinitesimal element 85 = S~^dS as the 
“ vector ” which leads from 5 to 5 dS. Our definition of 
volume on the group manifold [III, § 12] consisted in the 
following ; the parallelepiped defined by w* vectors 85 leading 
from the fixed point 5 to the neighbouring points S dS has 
as volume the absolute value of the determinant formed from the 
components of the vectors 85. In accordance with the above 
remarks we may take as components of the vector 85 = ||85a0|| 
the totality of coefficients 8s«g themselves. 

Any 5 can be expressed in the form 

S^UEU-^ (17.1) 

where E is a principal (diagonal) element of it and U is unitary. 
5 is unchanged on multiplying U on the right by any principal 
element. We employ a geometrical terminology which will 
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allow us to visualize our procedure by means of an analogy. 
Two elements U, U' of u which are right-equivalent with respect 
to the group of principal elements : U' — UE, will be said to 
“ lie on the same vertical [U].” B'rom the n^-dimensional mani- 
fold U we obtain by projection the (n® — n) -dimensional mani- 
fold [u] of verticals [U] on considering all points of U which 
belong to the same vertical to be coincident. This process of 
identifying equivalent elements was described in general in the 
beginning of Chapter III — we had, in fact, already met it in I, 
§ 1, in the special case of projection in affine space. We may now 
consider U in (17.1) merely as a representative element of the 
vertical [C/] ; on allowing [U] to run through the entire mani- 
fold [u] and the angles a>„ of E : 

e{oi^) 

E = 

e{o>n) 

to vary independently over the complete range 0 ^ a» < 27r 
the element 5 defined by (17.1) describes the manifold U exactly 
n ! times. 

The vector 8C/ = U~HU leads from the point U of the vertical 
[U] to the neighbouring point U -j- dU of the vertical [U + dU]. 
The totality of all points on [U dU] which are in the neigh- 
bourhood of U is given by expressions of the form 

{U + dU){\ + 8J5) = t; -f (dU + U 8£) 

where 8 jF is an arbitrary infinitesimal principal element with 
coefficients i 8a>„ on the principal diagonal ; the corresponding 
vectors are hU = 8U + 8E. Since the terms in the. principal 
diagonal of 817 are pure imaginary, E may be uniquely deter- 
mined in such a way that all terms in the principal diagonal of 
BU vanish ; we call this transition from [U] to [17 + dU] the 
“ horizontal transition from C7.” — The transition from some other 
point UE of the vertical [U] to the point {U + dU)E of [U dU] 
is accomplished by means of the vector 

8'U =■■ E-^'8U ‘ E. (17.2) 

That this linear transformation (17.2) determined by E, which 
sends 8U into 8'C7, is unimodular follows from our general re- 
marks concerning closed continuous groups — and can in this 
case be readily verified by direct computation. Naturally this 
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same equation holds for the horizontal transitions SU, h'U from 
U, UE respectively : 

B'U = E-^-W-E. (17.3) 

— n horizontal vectors BU leading out from U determine an 
infinitesimal “ parallelogram ” whose content is measured by 
the absolute value of the determinant of the — it components 
(a =1= j8) of the various vectors BU. On allowing each point 
U on the periphery of the parallelogram to describe the vertical 
|f/] wo obtain a tube who.se horizontal sections arc parallelo- 
grams ; its projection on [u] is the original clement of volume, 
the “ parallelogram ” defined by the BU. .Since the linear 
transformation (17.3), BU B'U, is unimodular, the content of 
each horizontal section is the same, and may therefore be con- 
sidered as the content of the volume clement on [u]. 

We now examine the variations in [U] and E in (17.1) when 
5 goes over into 5 + dS. We have 

UE 

and therefore 

dS‘U + S' dU ^^dU'E-\- U' dE. 


On multiplying both sides of this equation by U~^S~^ ~ E~^U~^ 
we find 

U-i -BS- U -\-BU = • BU • E + BE 

or 

B’S = U-^-BS - U ^ {E-^ • BU • E - BU} + BE. (17.4) 
The components of the matrix contained in parentheses arc 

We now define a parallelepiped at 5 which shall serve as a 
volume element in the following manner : — n of the n'^ 

sides 85 arc obtained from (17.4) on allowing the angles of 
rotation to remain fixed, i.c. BE — 0, and drawing — n hori- 
zontal vectors BU from the point U to form a volume element 
of magnitude d[U] on [u] ; the remaining n vectors 85 are then 
chosen in such a way that for each of them one and only one of 
the angles Wr changes by dw, and [C7] remains unchanged. The 
corresponding vectors 8'5 define, in accordance with (17.4), 
an element of volume of magnitude 
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Since the linear transformation 85 -> S'S = U~^ • 85 • t/ is uni- 
modular this volume is equal to that of the element defined by 
the 85 themselves. Since e = l/s the product 11 in (17.5) can 
be written 

^p) 

= — £»)(£« — ip) = A • A. 

q<p 

The final result is : The volume element described by 5 on allowing 
[U\ in (17.1) to describe an infinitesimal volume element of mag- 
nitude d[U] on [u] and on allowing the angles of rotation w, to vary 
by do>, has the magnitude 

dcjido}^ • • * doin' d[U]. (f7.6) 

On integrating with respect to d[U] over [u] we obtain the 
theorem, already ‘applied in the preceding section, concerning 
the magnitude of that portion of U in which the angles of rotation 
have values lying between and t«>„ -f do)y. 

These considerafions remain valid on restricting ourselves 
to the group u of unitary transformations with determinant 1. 
The angles of rotation are then subjected to the restriction 

+ wj + • • • + a)„ = 0, (17.7) 

and the only difference in the result is that the factor in 
(17.6) is to be omitted. Condition (17.7) allows us to normalize 
the linear form hioy-i + ‘ + hnOtn in the angles of rotation in 

such a way that = 0 ; the exponents {h-y, h^, • • •, hn) in the 
weights of the representations of U are then non-negative integers. 
It is desirable, however, not to impose this normalization = 0 ; 
we need then only to remark that only the differences between 
the hi are of significance : the irreducible representations 
f», ' ‘ *1 fn) of W aro unchanged on increasing each of the /,• 
by the same integer. In particular, these considerations justify 
the expression used in Chapter III for the volume on the group 
manifold of the unimodular unitary group Uj, and the results 
of the preceding section constitute a direct proof, which is inde- 
pendent of the completeness theorem, of the fact that the 
representations of U2 denoted by constitute a complete set of 
inequivalent irreducible representations of U2. 
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§ 18. Branching Laws 

Finally, we show the usefulness of our formulae for the 
characters by deriving two simple “ branching laws ” from them. 

1. Branching law for the Permutation Group. 

The irreducible representation of wy with the symmetry pattern 
P{fi, /z, • • •) reduces, on restricting Try to the sub-group 7ry_| of 
permutations of f — \ things, into the sum of those irreducible 
representations of 7Ty_i associated with the patterns 

PiA-hUU- • •); 

P{fl> /z /s) ' ■ ■) i 


those patterns in which the rows are not arranged in decreasing 
length are to be omitted. Each such constituent appears exactly 
once. (In words, these patterns are obtained from the original 
one by removing a field in turn from the end of each row which 
is actually longer than the following one.) 

Proof. Let 5 be a permutation of the numbers 1, 2 , • • •, 
/—I belonging to the class (fy — I, i^, G, • •' •). Considered as a 
permutation of the / numbers 1, 2, • • •,/, s leaves the last number 
fixed ; the number of one-term cycles is thus increased by 1, 
and s, considered as an element of rry, belongs to the class 
(ij, fj, 4, • • •). In the expansion 

A . aV- • • • - San-,,., ■ . . er- 4' • • • (18-1) 

we have as the coefficients of those terms for which 
h[ ^ /t; • 

or (18.2) 

according as any of the signs ^ in the above inequalities is 
actually = or not. xr the primitive character of 7ry_i belong- 
ing to the symmetry pattern Pif't, f'-i, • • •). On the other hand, 
the coefficient of e?‘ £ 2 * • • • [/ty > /tj > • • •] in A • a'f • • • is 
equal to the character of the representation of Try 

with pattern P{fi, f^, • • •). Hence on multiplying (18.1) with 
CTy = Ey + £2 -+- • • • -f- £„ we find 

Xhh h,, *„•■• + A,-l. A„ • • • + • • 

Our branching law follows from this result and (18.2). The 
branching law leads to a recurrence formula for the dimension- 
alities g(/y, fi, • • •). 
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2. Branching law for c„. 

On restricting c„ to the sub-group of linear transformations of 
an (n — 1)- dimensional sub-space the irreducible representation 
(fit A) ''') of Cn reduces into the sum of all those representations 
(/i. /a- of ^n-iM which 

/, a- S/,; (18.3) 

each of these constituents appears exactly once. 

Proof. The linear transformations 5 of the sub-space Cn-i : 
Xn = 0 are simply isomorphic to those linear transformations 
5 of the variables Xi, X 2 j * * *, which x^ -> x^. Hence e,, 

is to be replaced by 1 in the characteristic (16.9). The denom- 
inator is then 

Di^i, £2, • • •, Sn-l) • (Sl - 1.)(£2 - 1 ) • • • (s„_i - 1 ), 

as can be seen by subtracting the last column of Z)(£i, £ 3 , • * •, 
s„^i, 1 ) from each of the previous ones and factoring the resulting 
(n ■— l)-row determinant. In order to divide the determinant 
in the numerator by the factor (£1 — 1)(£2 — 1 ) • * * (£„_i — 1 ) 
we subtract the second column from the first, the third from the 
second, * • *, and finally the n^^ from the {n — l)®h The last 
row then is 0, 0, • • *, 0, 1 ; the determinant is thus reduced to 
a determinant of order (n — 1 ). Now divide each element in 
the row by £v — 1 in accordance with 

e^i — £^* , 

= £^*. 

e — 1 

The result is that we then have in the numerator the determinant 

|eA,-i + . . . £*,-1 4- ... -I- eA,^ . . .| 

(£ ~ £li ^2) ' ’ ’) £n-l)' 

But this is the sum of all (n — l)-rowed determinants of the form 
|e*'i, s'*'!, • • •, 

h^> h\^h^> h:,^}H> ■ • • > /t;_i 2 ? h, (18.4) 

On subtracting n — 1 from hi, n — 2 from hi and h 2 , ' ' 0 

from h'„_i and h„, in order to obtain the numbers / [(16.7)], the 
inequalities (18.4) become the inequalities (18.3) and our theorem 
is proved. 
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Proof of an Inequality 

{Page 77.) 

In order to prove the inequality stated on page 77 we must 
show that any continuous and differentiable function i/», which 
is defined for all values of the real variable x, satisfies the 
condition 

(•) 

— 00 — 00 — 00 

provided, of course, that the integrals involved actually exist. 
The Schwarz ine/juality 

l^l^l + * * * + ^ (^l^i + • • * T + ‘ ‘ ‘ + ^n^n) 

employed in Chapter I becomes, on replacing the sums by in- 
tegrals — or rather each sum by two integrals — 


Wfigidx + Ifzgzdxl^ ^ {\fJidx + lfzf]dx){^gigidx + ^g^^dx). 
Applying this inequality to 




dx 


by taking 


/l = Xlf,, fi = xifi, 


dijj dip 

dx’ ^ di’ 


and transforming the integral 


r d 

\x-^{ipip)dx into 



by partial integration over the range — oo, +oo, we obtain the 
desired relation (’*') provided the term xtpip^ which is integrated 
out, approaches 0 as a: -> ± 00 . That this is actually the case 
if the two integrals on the right of {*) converge can be seen by 
the following indirect proof. Let e be any pre-assigned positive 
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constant and consider a positive value of x for which x \ ^{x) ]* > e 

00 

and which is so large that J|^| — J- "*"he Schwarz inequality 

X 

X X 

e 

then tells us that for x x* ^ x A — 

X 

\4>{x') - ^{x)\^ ^ i ■ whence ^ \tf,{x)\ - 

The integral of x* | ^ [* over the range from ;r to .if + - is then 

X 

„ 1 £ £ £^ 

> ^ 7 ” * ” = r* 

i X X 4 

Hence it follows that conversely 
00 00 

l\fxf 


imply the inequality 


X\^{X)\’‘ ^ 6. 



APPENDIX 2 

A Composition Property of Group Characters 

{Page 169.) 

The fundamental property of the irreducible representation 
^ : s -> U{s) which is expressed in the equation 

U{st) = U{s)U{i) 

is paralleled by the relation 

X(^)X(0 = flxisr-^tr). (*) 

Proof, If X, y are two elements of the algebra of the group, 
the second of which belongs to the central, and if 

X -> X, y-^y in $), 

77 

then Y = ^1. The matrix associated with z = xy in is 

, ■ • f’? 

-A and its trace is — : 

S g 

i:<r)x{r) = I i:x{s)x{s) • i:y{t)x{t). 

r g I t 

On setting 

z(r) — Ex{s)y{t) [si — r) 

we find 

lAs) y{i) x{st) = .v(0 x(-^) x(0- 

«, t B 8, t 

Since y[t) depends only on the class of conjugate elements to 
which t belongs we may replace 

X{st) by 

on the left-hand side of the previous equation. Then the co- 
efficient of x{s)y[t) on either side of the equation depends only 
on the class to which the element t belongs, and since x[s) is an 
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arbitrary function, y{t) an arbitrary class function, the assertion 
(*) follows from the fact that the two coefficients must agree. 

We have omitted mention of this equation {*) in the text 
in order not to interrupt the systematic development of the 
theory of representations, which is completely described by the 
orthogonality relations and the completeness theorem. 
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A Theorem Concerning Non-degenerate Anti- 
symmetric Bilinear Forms 

[Page 274.) 

We consider the given non-degenerate anti-symmetric bi-linear 
form 

/ 

2^^tk^iyk [^ ki 

t, A; - I 

as the “ anti-symmetric product ” [jl)] nf the two vectors 
E — (a^i, X 2 , * * *, a:^'^ and 1) — Let be any non-vanishing 

vector ; then by assumption [Cij] cannot vanish identically in 
JC, and consequently a second vector C 2 can be found such that 
[t'iC 2 ] — 1. The simultaneous ecjualions 

[Cij] = 0, [C2r] 0 

then have / — 2 linearly independent solutions • • •, C/. d'liese 
vectors are furthermore such that no linear dependence can 
exist between them and 03, for if 

it follows on building the anti-symmetric products [Cij] " ^21 
[t'2E] ” “ ~ ^2 ~ We may therefore choose 

t'2, * * *, C/ as a co-ordinate system, i.e. as a basis from which 
all vectors may be constructed. Let the anti-symmetric pro- 
duct be expressed in terms of the components of J, 1) in 

this new co-ordinate system by 

/ 

fei)] = Uvik^iVk- 

t, k = I 

The manner in which the new fundamental vectors were deter- 
mined requires that of the coefficients 

yii ^ 0, yi2 1 ; ri3 = Vir 

721 —1, 722 = 0 ; 723 == 0, • • •, 72/ ^ 0. 

397 



398 


APPENDIX 3 


In consequence of the anti-symmetry all y,i, y.^ with f = 3, • • •, / 
vanish, and the matrix of the y^j^ is completely reduced into the 
2-rowed square sub-matrix 

0 1 
- 1 0 

and an (/ — 2)-dimensional anti-symmetric matrix. Mathe- 
matical induction with respect to the dimensionality / yields the 
desired theorem that / is necessarily even and that the original 
form can be transformed into 

(^ 1^2 — + i^sVi — + • • • terms) 

by an appropriate linear transformation. 
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theory refers to the more general case in which the perturbation 
function \V also depends explicitly on the time i ; equation (8.1) 
is valid in any case. See M. Born's investigation of the adiabatic 
principle in quantum mechanics, Zeits. f. Phys. 40, 167 (1927), 
and H. Weyl, l.c. (‘®). E. FT:rmi and F. Persico, Rend. Acc. d. 
Lincei (6) 4, 452 (1926) ; M. Born and V. Fock, Zeits. f. Phvs. 
51, 165 (1928). 

(23) 95. The recognition of the non-commutativity of multiplication 
and the discovery of these commutation rules was a most im- 
portant step in Heisenberg's first paper and in the further 
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development of the new quantum mechanics in the papers by 
Born, Heisenberg and Jordan cited in (*). 

(24) 96. An excellent account of the Hamilton- Jacobi theory of 
dynamics and of the perturbation theory of classical mechanics 
is to be found in the chapters of Geiger and Scheel’s Handbuch 
der Physik on these subjects b}'^ L. Nordheim and E. Fues : Vol. 
V, Chaps. Ill and IV. The English reader may refer to the book 
by M. Born cited in (*). For canonical transformations in quantum 
mechanics see P. Jordan, Zeits. f. Phys. 37, 383 ; 38, 513 (1926) ; 
F. London, Zeits. f. Phys. 40, 193 (1926) ; P. A. M. Dirac, Proc. 
Roy. Soc. 113 (A). 621 (1927). 

(26) 100. H. Weyl, Raum-Zeit-Materie, 6th ed., §§ 40, 41 (Berlin 
1923) ; or the English translation by H. L. Brose, Space, Time 
and Matter, § 35 (London 1922). E Schrodinger, Zeits. f. Phys. 
12, 13 (1922). F. London, Zeits. f. Phys. 42, 376 (1927). 

(26) 102. Collected Papers, p. 76. New data by J. S. Foster and 
L. Chalk, Proc. Roy. Soc. 123 (A), 108 (1929). 

(27) 104. This result is easily obtained by elementary methods for 
a rectangular parallelepiped. For the general proof see H. Weyl, 
Joum. f. d. reine u. angew. Math. 141, 163 ; 143, 177 (1912-13) ; 
Rend. d. Circ. Mat. Palermo, 39, 1 (1915). R. Courant has carried 
over the method from integral to differential equations : see 
Chap. VI in Courant-Hilbert, Methoden der mathematischen 
Physik 1. 

(28) 104. P. A. M. Dirac, Proc. Roy. Soc. 114 (A), 243 (1927). In 
addition to this paper on emission and absorption see also the one 
on dispersion to be found on p. 710 of the same volume. For 
Jeans* treatment of black body radiation, which led to the 
Rayleigh- Jeans radiation law, see J. H. Jeans, Phil. Mag. 10 (6), 
91 (1905). P. Debye, Ann. d. Phys. (4), 33, 1427 (1910), introduced 
the quantum of action into this theory. 

(29) 109. Led by arguments of a general statistical nature, Einstein 
had recognized the necessity for introducing stimulated emission 
long before the development of the new quantum mechanics and 
had derived equations (13.9), (13.10) : Phys. Zeits. 18, 121 (1917). 
The new quantum mechanics completes the derivation by obtaining 
the probability coefficient A, eq. (13.8), from the structure of the 
atom. 

(30) 109. V. Weisskopf and E. Wigner, Zeits. f. Phys. 63, 64 (1930). 


CHAPTER III 

(1) 110. For the general foundations of the theory of groups and 
the development of the theory of finite groups see : W. Burnside, 
Theory of Groups of Finite Order, 2nd ed. (Cambridge 1911) ; 
G. A. Miller, H. F. Blichfeldt and L. E. Dickson, Theory and 
Applications of Finite Groups (New York 1916) ; A. Speiser, 
Theorie der Gruppen von endlicher Ordnung, 2nd ed. (Berlin 1927). 

(2) 112. Vergleichende Betrachtungen iibcr neuere geometrische 

Forschungen (Erlangen 1872) ; Math. Ann. 43, 63 (1893) ; or 
F. Klein, Gesammelte mathematische Abhandlungen, Vol. I, 460 
(Berlin 1921). 
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(3) 120. Following the fundamental results of T. Molien on the theory 
of hyper-complex numbers (Math. Ann. 41 and 42, 1^93), the 
theory of representations of finite groups was developed princi- 
pally by G. Frobenius (Sitzungsber. Preuss. Akad. 1896-99). The 
most important general results were re-discovered by Burnside — 
cf. his book cited in (M above. The method developed by I. Schur, 
Neue Begriindung der Theorie der Gruppencharaktere, Sitzungsber. 
Preuss. Akad. 1905, 406, is particularly recommended for its 
clarity. 

(4) 134. The development of § 6 follows E. Noether, Math. Zeits. 

30, 641 (1929), in particular §§ 3 and 16. The uniqueness of com- 
plete reduction rather than reduction follows in general W. Krull, 
Math. Zeits. 23, 161 (1925) ; O. Schmidt, Math. Zeits. 29, 34 
(1928) ; R. Brauer and I. Schur, Sitzungsber. Preuss. Akad. 1930, 
209. 

(5) 152. Schur’s treatment of the theory of representations, cited in 

(3) is based on this lemma. 

(6) 153. W. Burnside, Proc. Lond. Math. Soc. (2), 3, 430 (1905). 

(7) 156. G. Frobenius and I. Schur, Sitzungsber. Preuss. Akad. 

1906, 209. 

(8) 161. The method of integration over the group manifold is due to 

A. Hurwitz, Gott. Nachr. 1897, 71, although it was applied by him 
to the theory of invariants rather than to the theory of groups. 
I. Schur first obtained the orthogonality properties of the 
characteristics of the continuous rotation group in this way and 
used them to prove the completeness of the system of known 
representations : Sitzungsber. Preuss. Akad. 1924, 189, 297, and 
346. 

(9) 166. For a modern book on algebra see L. E. Dickson, Algebras 
and their Arithmetics (Chicago 1923) ; the German edition, 
Algebren und ihre Zahlentheorie (trans. by J. J. Burckhardt and 
E. ScHUBARTH, Zurich 1927), follows an author’s revision which has 
not appeared in English. Also B. L. van der Waerden, Moderne 
Algebra II (Berlin 1931). An algebra was previously called a 
“ system of hyper-complex numbers,” and is at present to some 
extent in the German literature ; the algebra of a group is there 
referred to as a ” Gruppenring.” The usual procedure in modern 
algebra con.sists in reducing the algebra into simple matric 
algebras, in which case the theorems on realization by linear trans- 
formations appear as corollaries ; this development will be followed 
in Chap. V. 

(10) 173. See R. Weitzenbock, Invariantentheorie (Groningen 1923). 
The foundation for the proof of the fundamental theorem of the 
theory of invariants is the Hilbert basis theorem : D. Hilbert, 
Math. Ann. 36, 473 (1890). The author has shown (Math. Zeits. 
24, 392, 1926) that the fundamental theorem is valid for any closed 
and for any semi-simple continuous group. The older theory of 
invariants was almost exclusively concerned with the group Cn 
of all linear transformations with unit determinant. A really 
modern book on the theory of invariants is lacking. 

(11) 175. The theory has been presented by S. Lie himself, with the 
assistance of F. Engel, in a huge three- volume work : Theorie der 
Transformationsgruppen (Leipsic 1893, 1930). See also S. Lie, 
Vorlesungen iiber kontinuierliche Gruppen, ed. by G. Scheffers 
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(Leipsic 1893) and the brief presentation in H. Weyl, Mathe- 
matische Analyse des Raumproblem’s, 5th lecture and Appendix 8 
(Berlin 1923). The exclusively English reader may be referred 
to J. E. Campbell, Introductory Treatise on Lie's Theory of Finite 
Continuous Transformation Groups (Oxford 1903). 

(12) 180. E. Cartan, Bull. Soc. Math. d. France 41, 53 (1913). See 
also H. Weyl, Math. Zeits. 23, 275 (1925) ; M. Born and 
P. Jordan, Elementare Quantenmechanik, Chap. IV. 

(13) 181. For the more profound theory of ray representations see 
I. ScHUR, Journ, f. d. reine u. angew. Math. 127, 20 ; 132, 85 ; 138, 
155 (1904-11). 

(14) 184. This theorem is contained in my investigations on the 

representations of semi-simple groups : Math. Zeits. 23, 271 ; 24, 
328, 377 and 789 (1925-26). To this type of group belong : the 
groups Cn linear transformations with unit determinant, the 

rotation groups bn ^^id the “ complex group of all linear trans- 
formations which leave a non-degenerate ant-symmetric bi -linear 
form in two arbitrary vectors in a (2w) -dimensional space in- 
variant. The first and second of the above papers are concerned 
with these most important cases. The topological investigation 
of the rotation group is to be found in Chap. II, § 5 (24, 346). 


CHAPTER IV 

( 1 ) 191 . The theory of atomic spectra, which is developed in this and 
the following chapter, is to be compared constantl}^ with the 
empirical data ; in particular see the books by Hund, Pauling- 
Goudsmit and Grotrian cited in the Introduction. The applica- 
tion of the theory of repre.sentations of the 3-dimensional rotation 
group to atomic spectra is treated by E. Wignkr, Zeits. f. Phys. 
43 , 624 (1927) ; J. v. Neumann and E. Wigner, Zeits. f. Phys. 
47 , 203 ; 49 , 73 (1928). The subject has also been treated system- 
atically recently by E. Wigner : Gruppentheorie und ihre Anwen- 
dung auf der Quantenmechanik der Atomspektren (Braunschweig 
1931) ; for a report on the subject see C. Eckart, Application of 
Group Theory to the Quantum Dynamics of Monatomic Systems, 
Rev. Mod. Phys. 2, 305 (1930). The inner quantum number was 
introduced, on basis of the empirical data, by A. Sommerfeld, 
Ann. d. Phys. 63 , 221 (1920) ; 70 , 32 (1923). 

(2) 191 . The theory of the terms of diatomic molecules is treated in 
the following fundamental papers by F. Hund : Zeits. f. Phvs. 
36 , 657 (1926) ; 40 , 742 (1927) ; 42 , 93 (1927) ; 43 , 805 (1927) ; 
51 , 759 (1928) ; 63 , 719 (1930). Further .see : R. S. Mullikan, 
Phys. Rev. 32 , 186 and 761 (1928); 36 , 699 and 1440 (1930). 
M. Born and J. R. Oppenheimer, Ann. d. Phys. (4) 84 , 457 (1927). 
E. U. Condon, Phys. Rev. 28 , 1182 (1926) ; 32 , 858 (1928). A 
series of reports and discussions on this subject is to be found in 
Trans. Faraday Soc. 25 , 611-949 (1929) ; for a detailed report on 
the entire field of molecular spectra see R. S. Mullikan, Rev. 
Mod. Phys. 2 , 60 (1930) ; 3 , 89 (1931) ; see also the text by Ruark 
and Urey cited in the Introduction. 

(3) 191 . W. Elert, Zeits. f. Phvs. 51 , 8 (1928). Cf. H. Bethe, Ann. 
d. Phys. (5), 3 , 133 (1929) ; E. Huckel, Zeits. f. Phys. 60 , 423 (1930). 
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(4) 194. Appropriate methods for carrying through the perturbation 
calculations (method of the '' self-consistent field ") have been 
developed by : D. R. Hartrfe, Proc. Cambr. Phil. Soc. 24, 89 
(1928). J. A. Gaunt, Proc. Cambr. Phil. Soc. 24, 328 (1928). 
Also J. C. Slater, Phys. Rev. 32, 339 (1928) ; 34, 1293 (1929) ; 36, 
57 (1930). E. U. Condon, Phys. Rev. 36, 1121 (1930) ; E. U. 
Condon and G. H. Shortley, Phys. Rev. 37, 1025 (1931). V. 
Fock, Zeits. f. Phys. 61, 126; 62, 795 (1930). G. Breit, Phys. 
Rev. 35, 569 ; 36, 383 (1930). W. Heitler and G. Rumer, Zeits. 
f. Phys. 68 , 12 (1931). 

(5) 201. See the report by H. Hone, Ann. d. Phys. (4^ 79, 273 (1926). 
For a derivation of the formulae on quantum mechanics, although 
not from the group-theoretic standpoint, see M. Born, W. Heisen- 
berg and P. Jordan, Zeits. f. Phys. 35, 557 (1926). Also in Chap. 
IV of Born and Jordan, Elementarc Ouantenmechanik. 

(6) 203. W. Pauli, Zeits. f. Phys. 43, 60r(1927). 

(7) 203. G. E. Uhlenbeck and S. Goudsmit, Naturwiss. 13, 953 

(1925) ; Nature 117, 264 (1926). 

(8) 205. O. Richardson, Phys. Rev. 26, 248(1908). A. Einstein and 
W. J. DE Haas, Verhandl. d. Deutsch. Phvs. Ges. 17, 152 (1915) ; 
18, 173 (1916). E. Beck, Ann. d. Phys. (4), 60, 109 (1919). 
S. J. and L. J. H. Barnett, Phys. Rev. 17, 404 (1921). A. P. 
Chattock and L. F. Bater, Phil. Trans. Roy. Soc. 223, 287 (1922). 

(9) 207. A report on a unified notation for the designation of terms of 

atomic spectra ki terms of quantum numbers has been presented 
by H. N. Russell. A. G. Shenstone and L. A. Turner, Phys. 
Rev. 33, 900 (1929). It has also been found necessary to ascribe 
a spin to the atomic nucleus in order to account for the hyper-fine 
structure : E. Back and S. Goudsmit, Zeits. f. Phys. 43, 321 (1927) ; 
47, 174 (1928) ; S. Goudsmit and R. F. Bacher, Phys. Rev. 34, 
1501 (1929) ; S. Goudsmit, Phys. Rev. 37, 663 (1931). J. 

Hargreaves, Proc. Roy. Soc. 124 (.\), 568 (1929). E. Fermi, 
Zeits. f. Phys. 60, 320 (1930). G. Breit, Phys. Rev. 37, 51 (1931). 

(10) 209. E. Back and A. Lande, Zeemaneffekt und Multiplettstruk- 
tur (Berlin 1925) A. Land£. Zeits. f. Phys. 15, 189 (1923). W. 
Pauli, Zeits. f. Phys. 16, 155 ; 20, 371 (1923). A. Land]6, Zeits. 
f. Phys. 25, 46 (1924). W. Heisenberg and P. Jordan, Zeits. f. 
Phys. 37, 263 (1926). K. Darwin, Proc. Roy. Soc. 118 (A), 264 
(1928). For (ij) and (5/) coupling see J. H. Bartlett, Phys. Rev. 
35, 229 (1930)'. 

(11) 210. H. Weyl, Math. Zeits. 23, 292 (1925). J. v. Neumann 
and E. Wigner, Phys. Zeits. 30, 467 (1929). 

(12) 210. Proc. Roy. Soc. 117(A), 610; 118, 351 (1928). C. G. 
Darwin, Proc. Roy. Soc. 118 (A), 654 (1928). Landi^, Zeits. 
f. Phys. 48, 601 (1928) ; in the same volume F. Moglich, 852, 
and J. V. Neumann, 868. V. Fock, Zeits. f. Phys. 55, 127 (1929). 
For the older work concerning the interaction of spin and orbital 
moment of momentum see L. H. Thomas, Nature, 117, 514 (1926) ; 
J. Frenkel, Zeits. f. Phys. 37, 243 (1926) ; W. Heisenberg and 
P. Jordan in the same volume, 863. 

(13) 217. P. A. M. Dirac in Quantentheorie und Chemie, Leipziger 
Vortrilge, 1928, 83 (Leipsic 1928). 

(14) 220. H. Weyl, Proc. Nat. Acad. Sci. 15, 323 (1929) ; Zeits. f. 
Phys. 56, 330 (1929). V. Fock, Zeits. f. Phys. 57, 261 (1929). 
V. Ambarcumian and D. Ivanenko, C. R. Acad. sc. USSR. 1930, 45. 
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(16) 224, See Wentzel's report cited in II (^*) ; A. Sommerfeld, Wave 
Mechanics; Born and Jordan, Elementare Quantenmechanik ; O. 
Klein and Y. Nishina, Zeits. f. Phys. 62, 863 (1929). Y. Nishtna, 
same volume, 869. 

(16) 237. A. Sommerfeld, Ann. d. Phys. (4) 51, 1 (1916). For the 
significance of these results for the theory of X-ray spectra see 
Sommerfeld 's book cited in the introduction. Perturbation 
calculation in the new quantum mechanics, W. Heisenberg and 
P. Jordan, l.c. (^®) ; exact derivation by means of the Dirac theory 
of the electron ; W. Gordan, Zeits. f. Phys. 48, 11 (1928) ; C. G. 
Darwin, l.c. (^*) ; A. Sommerfeld, Wave Mechanics, p. 267 ff. 

(17) 241. W. Heisenberg, Zeits. f. Phys. 38, 41 1 (1926). Correspond- 
ing energv calculation for He atom ; W. Heisenberg, Zeits. f. 
Phys. 39," 499 (1926). P. A. M. Dirac, Proc. Roy. Soc. 112 (A), 
661 (1926). J. A. Gaunt, Proc. Roy. Soc. 122 (A), 613 (1929) ; 
Phil. Trans. Roy. Soc. 228 (A), 161. Y. Sugiura, Zeits. f. Phys. 44, 
190 (1927). W. V. Houston, Phys. Rev. 33, 297 (1929). J. C. 
Slater, Phys. Rev. 32, 349 (1928). G. Breit, Phys. Rev. 34, 
553 (1929) ; 36, 383 (1930). The '' symmetric sub-space leads 
to the Einstein-Bose statistics, which is discussed in the references 
cited in II (•) above. The statistics arising from the “anti-sym- 
metric “ sub-space was developed by E. Fermi, Zeits. f. Phys. 
36, 902 (1926) and applied by W. Pauli, Zeits. f. Phys. 41, 81 
(1927), to the explanation of paramagnetism and by A. Sommerfeld 
to the electron theory of metals : A. Sommerfgld, W. V. Houston 
and C. Eckart, Zeits. f. Phys. 47, 1 (1928). 

(18) 244. E. C. Stoner, Phil. Mag. (•) 48, 719 (1924). W. Pauli, Zeits. 
f. Phys. 31, 765 (1925). It is to be remembered that this develop- 
ment antedates the new quantum theory and the theory of the 
spinning electron, and that Pauli's introduction of the four 
quantum numbers n, I, J, m demanded a complete re-classification 
of all spectroscopic material. 

(19) 248. P. A. M. Dir.\c. Proc. Roy. Soc. 114 (A), 243 (1927). On 
taking the interaction of the particles into account : P. Jordan 
and O. Klein, Zeits. f. Phys. 45, 751 (1927). 

(20) 250, 280. P. Jordan and E. Wigner, Zeits. f. Phys. 47, 631 
(1928). 

(21) 253. P. Jordan and W. Pauli, Zeits. f. Phys. 47, 151 (1928). 

G. Mie, Ann. d. Phys. 85, 711 (1928). W. Heisenberg and 
W. Pauli, Zeits. f. Phys. 66, 1 (1929) ; 59, 168 (1930) ; W. Heisen- 
berg, Zeits. f. Phys. 65, 4 (1930) ; Ann. d. Phys. 9, 338 (1931). 
L. Rosenfeld, Zeits. f. Phys. 63, 574 (1930). J. R. Oppenheimer, 
Phys. Rev. 35, 461 (1930). G. Breit, l.c. E. Fermi, Rend. 

Acc. d. Lincei (6) 9, 181 (1929). L. Landau and R. Peierls, 
Zeits. f. Phys. 62, 188 (1930). L. Rosenfeld, Ann. d. Phvs. (5) 
5, 113 (1930). 

(22) 257. H. Weyl, Journ. f, d. reine u. angew. Math. 141, 163 (1912). 

(23) 261. See P. Jordan, Die Lichtquantenhypothese, in : Ergeb- 
nisse der exacten Wissenschaften, 7, 168 (1928). 

(24) 262. P. A. M. Dirac, Proc. Roy. Soc. 126 (A), 360 (1930) ; Proc. 
Cambr. Phil. Soc., 26, 361 (1930). J. R. Oppenheimer, Phys. 
Rev. 35, 939 (1930). For a report on this theory see P. A. M. Dirac, 
Nature, 126, 605 (1930). For an attempt to avoid the negative 
energy levels by a reduction of all operators see E. SchrOdinger, 
Sitzungsber. Preuss. Akad. 1931, 63. 
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(25) 264. See articles by Heisenberg-Pault and Rosenfeld cited 
in (*M. 

(26) 276. H. Weyl, Zeits. f. Phys. 46, 1 (1927). 

(27) 280. A rigorous proof of thCvSe theorems for oo-dimensional space 
has been announced by M. H. Stone, Proc. Nat. Acad. Sci. 16, 
172 (1930) ; J. v. Neumann informs me in a recent letter that he 
has also obtained a proof of this theorem. 


CHAPTER V 

(1) 284. The transition from the group 27o to the algebra i7, which is 

suggested by quantum mechanics, has also improved the theory 
from the purely mathematical standpoint ; see H. Weyl, Ann. 
of Math. (2) 30, 499 (1929). The connection between the repre- 
sentations of or Cn I^rst clearly seen by I. Schur in 

his Dissertation (Berlin 1901). Further see : H. Weyl, Math. 
Zeits. 23, 271 (1925) ; I. Schur, Sitzungsber. Preuss. Akad. 1927, 
58; 1928,100. On the symmetry classes of tensors .see : A. Young, 
Proc. Lond. Math. Soc. 33, 97 (1900) ; 34, 361 (1901). H. Weyl, 
Rend. Circ. Mat. Palermo, 48, 29 (1924). 

(2) 287. This has been emphasized by P. A. M. Dirac, Proc. Roy. 

Soc. 123 (A), 714 (1929). 

(3) 291. G. Frobenius used the term "characteristic unit" for 
this concept (see Sitzungsber. Preuss. Akad. 1903, 328), and this 
name has been laken over into the physical literature. But in the 
meantime the term " idempotent " has been used in systematic 
investigations on algebras. The notions of " right- and left-invari- 
ant sub-algebra " and " left-invariant sub-algebra " correspond 
with those of “ ideal " and " left-ideal " in arithmetic when all the 
elements of the algebra are considered as " integers." 

(4) 303. E. Steinitz, Journ. f. d. reine u. angew. Math. 137, 167 (1910). 

(5) 307. Our proof of this theorem follows E. Noether, Math. Zeits. 

30, 641 (1929). 

(6) 313. In the older investigations T. Molien (Math. Ann. 41 and 
42, 1893) and G. Frobenius operate in the field of all complex 
numbers. The extension to arbitrary fields is due to J. H. M. 
Wedderburn. and is also valid for algebras which are not com- 
pletely reducible — a branch of the subject into which we have not 
entered : J. H. M. Wedderburn, Proc. Lond. Math. Soc. (2) 6, 99 
(1907) ; Bull. Am. Math. Soc. 31, 11 (1925). See also the book by 
Dickson referred to in III (•). Our proof follows E. Noether, l.c. 
(®). See further E. Artin, Abh. Math. Semin. Hamburg, 5, 251 
(1927) ; G. Kothe, Math. Zeits. 32, 161 (1930). 

(7) 320. E. WiGNER, Zeits. f. Phys. 40, 492 and 883 (1926-27). W. 

Heitler, Zeits. f. Phys. 46, 49 (1927). Only the simplest cp.se, 
that in w'hich the unperturbed term of V consists of / different, 
non-degenerate terms of the individual /, is considered in detail 
in these papers. 

(8) 328. This direct derivation follow's H. Weyl, Math. Zeits. 23, 

271 (1925). 

(9) 338. See G. Frobenius, Sitzungsber. Preuss. Akad. 1898, 501. 

(10) 340. W. Heitler and F. London, Zeits. f. Phys. 44, 455 (1927). 
W. Heitler, Zeits. f. Phys. 46, 47 (1927) ; F. London, in the same 
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volume, 455. W. Heitler, Gott. Nachr. 1927, 368 ; Zeits. f. Phys. 
47, 835 (1928). F. London, Zeits. f. Phys. 50, 24 (1928). W. 
Heitler, Zeits. f. Phys. 51, 805 (1928). M. Delbruck, Zeits. f. 
Phys. 51, 181 (1928). F. London, in : Qiiantenthcorie und 
Chemie, Leipziger Vortrage 1928, 59 (Leipsic 1928) ; Zeits. f. 
Phys. 63, 245 (1930). M. Born, Zeits. f. Phys. 64, 729 (1930). 
J. C. Slater, Phys. Rev. 37, 481 ; 38, 1109 (1931). L. Pauling, 
Journ. Ann. Chem. Soc. 53, 1367 (1931). 

(12) 342. The calculation is carried through in the first paper by 
Heitler and London cited in (^®). Further see : Y. Sugiura, 
Zeits. f. Phys. 45, 484 (1927). S. C. Wang, Phys. Rev. 31, 579 
(1928) ; 28, 663 (1927). E. C. Kemble and C. Zener, Phys. Rev. 
33, 512 (1929). P. M. Morse and E. C. G. Stuckelberg, Phys. 
Rev. 33, 932 (1929). 

(13) 346. Zeits. f. Phys. 50, 24 (1928). 

(14) 347. Zeits. f. Phys. 49, 619 (1928) ; SoMMERFELD-Festschrift : 
Probleme der modernen Physik (I.eipsic 1929). 

(15) 357. P. A. M. Dirac, l.c. {^). For a detailed term calculation 
following this scheme and examples see papers by Slater, Condon, 
Condon-Shortley, Born-Rumer cited in (4) above. 

(16) 358. The introduction of the symmetry operators c into the 
theory of invariants is due to A. Young, l.c. (M. But he proved the 
irreducibility of neither nor ; that of the first was proved by 
G. Frobenius, Sitzungsber. Preuss. Akad. 1903, 328, and that of 
the latter by E. Cartan, Bull. Soc. Math. d. France, 41, 53 (1913) 
and H. Weyl, l.c. (*). The symmetry classes ^ere re-divscovered in 
quantum mechanics by F. Hund, Zeits. f. Phys. 43, 788 (1927). 

(17) 362. The development from theorem (14.2) to (14.8) follows a 
train of thought communicated to the author in a letter from 
J. V. Neumann, 

(18) 370. See F. Hund, l.c. (^«) ; J. v. Neumann and E. Wignkr, Zeits. 
f. Phys. 47, 203 ; 49, 73 (1928). 

(19) 370. F. London, Zeits. f. Phys. 46, 455 (1928). 

(20) 372. W. Heitler, Zeits. f. Phys. 51, 805 (1928). 

(21) 378. Follows H. Weyl, l.c. (®). In the same way the character- 
istics of the rotation group in tz-dimensional space, the “ complex 
group " and all semi-simple groups can be calculated : Math. 
Zeits, 24, 328, 377 and 789 (1926). 

(22) 382. L.c. (®). On removing the unitary restriction, the proof 
that we here obtain all irreducible representations of rc(|uires 
the use of the infinitesimal elements of the group. The knowledge 
won for Un has been carried over to Cn under the broadest assump- 
tions by J. v. Neumann, Sitzungsber. der Preuss. Akad. 1927, 26 ; 
Math. Zeits, 30, 3 (1929) ; and I. Schur, Sitzungsber. Preuss. Akad 
1928, 100. 

(23) 383. Sitzungsber. Preuss. Akad. 1900, 516. 
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OPERATIONAL SYMBOLS 


The number refers to the page on which the symbol is defined 

-> with ... is associated ... 110, 114. 

-3 is contained in 290. 

conjugate complex of x 15. 

transposition : for operators 13, symmetry quantities 352, 
symmetry patterns 361. 

Hermitian conjugate : for operators 17, elements of an 
algebra 167. 

^ a[s) " ci{s~'^) 296. 

o contragredient matrix 123, representation 123. 

~ equivalent as correspondences of the ray field 21. 

~ transforms as 145. 

( ) scalar product 16, 32. 

[ ] vector product (in 3-dimensional space) 27 ; commutator 
[HA] - -^HA - AH) 264. 

< ) temporal mean value 88. 

X for vectors 90, vector spaces 90, correspondences 91, 
representations 126, groups 127, algebras and their 
elements 333. 

X multiplication of representations of two groups 127. 

addition of representations 113. 
fjl transition from p to ^ 287. 

D transition from ^ to I) 290. 
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LETTERS HAVING A FIXED SIGNIFICANCE 

The number refers to the page on which the quantity is defined 

LATIN 

c velocity of light ; a Young symmetry operator 359. 
e primitive idempotent element (generating unit 291) ; — e- 
charge of the electron. 
e{x) — e'^. 

(F„ Ey, Ef) = @ electric held strength 99. 

Ei energy level 44. 

/ number of electrons, order of a tensor 139, 281. 

4-vector potential multiplied by efih 214. 

f., curl off. (= 

F action of the electro-magnetic field 216. 

F(ij, 12 . • • •. if) tensor 139, 281. 

g dimensionality of a group representation 120, Lande g- 
factor 204, 207. 

h Planck’s quantum of action divided by 2tt 51, order of a 
finite group 118. 

H energy 51. 

{Hx, Hy, Hfj — § magnetic field strength 99. 

I signature 188. 

j, J inner quantum number 189, 190. 

Jix total energy-momentum vector 220. 
k auxiliary quantum number 228. 

I, L azimuthal quantum number 64, 185, 194 — for s, p, d, f, g, 

. . . terms / = 0, 1, 2, 3, 4, . . . . 

{Lx, Ly, Lfj — 8 orbital moment of momentum 63. 
m magnetic quantum number 64, 193, multiplicity of a re- 
presentation 321, 350 ; (= p.) mass of the electron. 

Wo = mclh. 

M, M' action of the material field 211. 

{Mx, My, Mf) = total moment of momentum 179, 187. 
n dimensionality of a vector space 1 ; principal quantum 
number 69. 
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LETTERS HAVING A FIXED SIGNIFICANCE 411 


LATIN 

p, q canonically conjugate variables 94, a permutation in the 
rows, columns of a symmetry pattern 369. 

{px, pD, Pz) = P linear momentum of a particle 51. 

P symmetry pattern 368. 

i^x, <lv, ^z) ~ ^1 electric dipole moment 83. 

r distance from centre. 

5 element of a group ; spin quantum number 206. 

{s^, Sy, s^) = § electric current density 218, 5* charge-current 
4-vector 214. 

(5*, Sy, S,) = © spin 178, 203. 


1 0 


0 1 

II 

0? 

0 

— I 

,53 = 

1 0 

0 1 


1 0 


i 

0 


0 - 1 


energy-momentum tensor 218. 

T interchange of ifi^, tpi, 1^9. 

V valence 369. 

W perturbation energy 86, total action 216. 

Xq Xi x^x^or t X y 2»co-ordinates of space time {t = Xq 98, or ct = Xq 

211 ). 


GERMAN. (For 3-dimensional vectors see their components 
under Latin letters.) 

C = C„ group of (unimodular) linear transformations in n dimen- 
sions 128. 

(c)^ representation of c whose substratum is the tensors of 
order / 125. 

= ‘^j{v — 2j) representation of yth degree of C2 or Uj ~ bs 
128, 142. 

b„ orthogonal group in n dimensions 142 ; b’„ same but in- 
cluding improper rotations 143. 

^(wi) 1-dimensional representation of rotation group bj 141. 

Cl, 62, . . ., e„ co-ordinate system in vector space 2. 

@ unitary representation of the rotation group induced in 
the function space of tp{x y z) 143. 

g abstract group 114. 

la conjugation 118. 

m mean value 168. 

5R representation of the rotation group induced in system 
space 187. 

j), ^ invariant sub-space of r, W respectively 287, 282. 

n 

r an algebra considered as a vector space 286, to = t = Ij 9?-^ 
290, 350. 



412 LETTERS HAVING A FIXED SIGNIFICANCE 


GERMAN 

91 vector space |, 91^ corresponding space of tensors of order /, 
[9l'^] space of the symmetric tensors, (9I'^} space of the 
anti-symmetric tensors, 239, 242. 

9Io system space of electron translation, spin 196. 
ta left- translation 116. 

U = U„ (unimodular) unitary group in n dimensions 139. 

SS ray representation giving rise to algebra of complex 
quaternions 182. 

5 vector in n dimensional vector space 1. 


GREEK 

a = e^jeh fine structure constant 216. 

^ik 


Kronecker symbol = 1 or 0 according as i = k or i k 17. 


+ « 


8(a;) Dirac 8-function (= 0 except for at = 0 and \B{x)dx = 1) 
255. - . 

8, = ± 1 according as s is an even or an odd permutation 121. 

8 signature 201. 

^2 ^2 ^2 

^ ^ operator a 52. . 

^ 5 ? 

e generating element of a right- and left-invariant sub-space 
311. 

0, <f> polar co-ordinates 60. 

m) mass of the electron, 

V frequency 50. 

o — Larmor factor — unit of Zeeman separation. 

rr = TTf symmetric group of permutations of /objects 121. 
p electric charge density 218, an algebra 304. 

(f)ac electro-magnetic 4-vector potential 98. 

«/r vector defining the state of the material field 49. 

Xy X group characteristics, 150, 151. 
oj angle of rotation 151. 



INDEX 


The numbers refer to pages of the texty those in boldface to the pages ivhere the 
concepts introduced in boldface are defined 


Abelian group 118 , its unitary irreduc- 
ible representations 140, in ray space 
182, quantum kinematics as A. g. 
of rotations 272 ff. A. system of 
forms 25. 

Absorption of photon 44, quantum 
theory of a. 107, 224, 261, a. lines 45. 

Action of material field 21 1, of electro- 
magnetic field 215, total 216, 222. 

Adaptation of co-ordinate system to 
sub -space 3. 

Addition of vectors €, of correspond- 
ences 6, of matrices 7i repre- 
sentations 126 , of elements of an 
algebra 165, 303, of numbers of a 
field 302, direct sum of algebras 311 . 

Affine correspondence 5 , see Corre- 
spondence, linear ; a. geometry i ff., 
1 12. 

Algebra, general concept 303 , of group 
166 , 18 1, 286, simple 31 1, 313, 

semi-simple 316, order of a. 304 , 
modulus or principal unit 168, 304, 
basal units 16S, 304, division a. 

(— field) 304 , 316, central of a. 167 , 
31 1, invariant sub-a. 167 , 280, 

generating unit of s.-a. 168, 291 , 
direct sum 311 , direct product 333, 
reduction into simple matric a. 167. 
309 lb, 315 ; — representation of a. 166 , 
304 lb, regular representation 289 , 
complete reduction of representation 
306 ; — a. of complex quaternions 182, 
of linear transformations 307, of 
symmetric transformations 282, 332 , 
its enveloping a. 284, reduction of 
a. of linear transformations 307 ff. 

Alkali spectrum 85, 86, 202, doublets 
in 204, with anomalous Zeeman 
elTect 205. 

Alkaline earth spectrum 207, 246. 

Alternation 358. 


Alternation law 207, 370. 

Atom, Rutherford’s model xiii, Bohr’s 
theory of a. 43, radiation on classical 
and Bohr theories 44, on quantum 
theory 104 ff., 256 ff., Hund’s vector 
model of a. 191, 244 ; see Spectrum. 

Automorphism 115 , automorphic corre- 
spondence of group 134 , 

Auxiliary quantum number, see under 
Quantum number. 

Azimuthal quantum number, see under 
Quantum number. 

Balmer 45. 

Bessel’s inequality 33, for system of 
representations 169. 

Black body radiation 41, 104, 256. 

Bohr, H. 39. 

Bohr magneton 66, 205. 

Bohr, N. xiii, 43, 95, 105, 236, 245. 

Boltzmann io8. 

Born 48, 74. 

Bose 50. 

Bounded Hermitian form 39. 

Brackett 46. 

Branching rule, for spectra 207, for 
linear and permutation groups 390 ff. 

de Broglie, L. 48, 53, 21 1, 220. 

Burnside’s theorem 153. 

Canonical variable 52, 94, c. trans- 
formation 96 . in quantum mechanics 
98, c. aggregate 79, c. basis for 
rotations in ray space 274. 

Central, of group 118 , of algebra 167 , 

313. 
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INDEX 


Character, group or group character- 
istic 150 , 327, 395, of unitary re- 
presentation 156, primitive c. 150, 
150, behaviour on addition and 
multiplication 15 1, oithogonality pro- 
perties 156, 159 ff., 317. For char- 
acters of special groups ste under 
qualifying adjective. 

Character of element of algebra 295. 

Characteristic number of Hermitian form 
or operator 21, 35, of unitary form 26, 
multiplicity of c. n. 22, 26, of energy 
56, 80 ; — characteristic vector or func- 
tion 21, 35, of wave equation 56, 80 ; 

— c. space 22, of energy 80, 192, of 
moment of momentum 189, 192. 

Class of conjugate elements 118, in 
symmetric permutation group 328 ; 

— c. function 150, 156, as element in 
central of group algebra 169. 

Classical mechanics compared with 
quantum mechanics xiii, 73, 81, 94, 
190, “ c.” combination principle 47, 

82. 

Clebsch-Gordan series 128, 163, 190, 
371, as quantum rule for composi- 
tion of moment of momentum 190, 
as valence rule 371. 

Closed shell 86, 245. 

Cogredient transformation 5. 

Collision phenomena 46, 70 ff. 

Combination principle, Ritz-Rydberg 
44, 48, 82, “ classical ” 47, 82. 

Commutation rules, Heisenberg’s 94 , 
274, interpretation of 275, wave 
equation derived from c. r. 277 ff., 
c. r. for infinitesimal rotations 178, 
for moment of momentum 179, for 
spin 227, in second quantization 249, 
for Maxwell-Dirac equations 254 ff. 

Commutative field 302 , c. group 118 , 
c. operators transformed simultane- 
ously to principal axes 25. 

Commutator 177, 264, 267.’ 

Commutator form 273. 

Completeness of unitary-orthogonal sys- 
tem of functions 3 , of spherical 
harmonics 62, on group manifold 
170, c. of system of unitary repre- 
sentations 140, 159, 170, 305, 318, 
of product representation 164; — com- 
plete system of orthogonal vectors in 
3-space 257. 

Complete reduction of correspondences 
or representation 9 , 122 , sometimes 
equivalent to reduction 18, 123, 136, 


292. 301, 306, 308, of product re- 
presentation 140, of {Sf X 128, 
190, uniqueness 136, 156, c. r. of 
system space wdth respect to energy 
80, of representation induced in 
system space by bs 188, of group 
space 294, of tensor space 301, of 
an algebra into simple matric algebras 
167, 309 ff., 315. 

Composition of physical systems 91, 
behaviour of energy on c. 92, 193, of 
moment of momentum 190, c. of 
equivalent individuals 239, 24 1 , under 
Pauli exclusion principle 244, method 
of c. compared with second quantiza- 
tion 248 ; — c. of transformations 6, 
no, see Multiplication. 

Composition series, of sub-groups 132 , 
of sub-spaces 122, 135. 

Compton effect 224. 

Condon 74. 

Congruent modulo sub-space 4. 

Conjugate of element of group 118 , 
for permutation group 328, of ele- 
ment of algebra 167. 

Conjugation 118. 

Conservation law, for electricity 214 ff., 
energy 82, 218, 220, momentum 218, 

220, moment of momentum 188, 

221, Dirac’s c. 1 . 227, of quantum 
field 264 ff. 

Contact transformations 96 . 

Contragredient transformation 12 , re- 
presentation 123 . 

Contravariant vector 13. 

Convex region 79. 

Co-ordinate system, in vector space 2 , 
adapted to sub-space 3, transforma- 
tion of c. s. 4, normal c. s. 16 , 
21, Heisenberg’s c. s. 80 , in special 
relativity 147, in general relativity 
219. 

Correspondence or transformation, 
general 110, identical 110, inverse 
m, product in, isomorphic 112, 
automorphic 134 , similarity 283 ; — 
linear 6 ff., 21, = projection 282, 

in function space 35, trace 11, KO, 
dual 13 , 123, contragredient 12 , 

Hermitian 18 , unitary 16 , infinites- 
imal imitar>' 28 ff., rotation of ray 
space 20, X •multiplication 90, re- 
duction and complete reduction 9, 
irreducible system of 1. c. 122, 153 ff., 
symmetric c. in tensor space 282 . 
For special groups of correspondences 
see under qualifying adjective. 



INDEX 


Correspondence principle 95. 

Coupling, Russell-Saunders or (si) 206, 
(;>*) 206. 

Courant 40. 

Covariant linear quantity 173 , in 
quantum mechanics 197 ; — c. vector 

13. 

Cycle of a permutation 328. 

Cyclic group 1 17. 


Davisson 50, 53, 70. 

Decomposition, see Complete reduction, 
of space 3, 122, of dual space 14, 
in unitary geometry 18, into char- 
acteristic spaces, 22. 

Degenerate system 83, perturbation 
of 86, accidental degeneracy 192. 

Degree of a representation 120. 

8-function 36, 255. 

Derivative of operator 94. 

Dimennonality of space 2, 3, of a 
representation 120. 

Dirac 109, 210, 21 1, 217, 225, 255, 260, 
262, 357. 

Dirac’s relativistically invariant equa- 
tions for electron 213, 218, 225, in 
central field 227 ff., quantization of 
253 ff- ; — D. theory of proton 262 

Directional quantization 67, 75, 205. 
Dispersion 53, 224. 

Division algebra (= field) 304 , 316. 
Double tensor 347. 

Dual space 12 , matrix 13 , system of 
transformations 123, symmetry ele 
ment and representation 352, sym- 
metry pattern, 361, 369. 

Dynamical variable, represented by 
Hermitian form 74, 275, measure- 
ment of 74 ff., mean value or ex- 
pectation 75, intensity on transition 
83 , 197, composition qi, totality of 
d.v. represented by irreducible system 
238 ; — d. law 54, 80 ff., 97, 187, 266. 

Dynamically independent systems 92. 

Effective quantum number, see uyider 
Quantum number. 

Einstein 42, 50. 

Electric charge, atomicity of 216, posi- 
tive and negative 262, e. c. density 
and current density 215, conserva- 
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tion of e. c. 214, 217, e. dipole moment 
83, 104, 197. 

Electro-magnetic field, effect on charged 
particle 98, 213, 222, interaction with 
matter 105, 261, equations of 102, 
218, quantization 104, 253, action 
215 - 

Electron, de Broglie’s equation for e. 
53, Schrodinger’s 54, iii, Dirac’s 
213, e. beams 50, spin 195 , 196 , 
203, 276, translation 196 , in spher- 
ically symmetric field 63, 227, nega- 
tive energy levels and “ positive e.” 
225, existence vs. constitution of e. 
261, e. and proton 262. 

Element, of group 114 , of group alge- 
bra 166 , of algebra 303 , idem- 
potent e. 168, 291 , independent 292 , 
primitive 293 , real 295, trace 299 , 
317 , scalar product 299 , character 
of an e. 295. 

Elsasser 74. 

Emission, of photon 44, quantum 
theory of e. and absorption 107, 224, 
261, spontaneous 107, stimulated 
108. 

Energy, and its operator 51 ff., 80 ff., 
97, '187, 215, e. level 44, 50, 
collision phenomena 70, in perturba- 
tion theory 86 ff., on composition 92, 
in electro-magnetic field 101, with 
spin 215, 220, e. of radiation field 
103, 258, e. of simple state 189, loi, 
of system of equivalent individuals 
320 ff,, 356, of molecule 346, ex- 
change e. 322, 342, 346, e. and 
momentum 51, 218, 220, conserva- 
tion 188, zero-point c. 104, 258, 261, 
inertia of e. 221, e. quantum 41. 

Enveloping algebra 2S4, for double ten- 
sors 348. 

Equality, axioms of II2. 

Equivalence degeneracy 239 ff., 320. 

Equivalent individuals, state of system 
consisting of e. i. 239 ff., energy 241, 
320 ff., 356, quantization 246. 

Equivalent systems of linear transforma- 
tions 121, e. representations 120, 
sub-spaces 135 , 283, e. points with 
respect to transformation 112, e. 
elements with respect to sub-group 
1 18. 

Euclidean geometry 15, 112. 

Exchange energy 322, 342, 346. 

Expectation or mean value of physical 
quantity 75, 78, 92. 

Exponential function 28, of matrix, 29. 
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Factor group 119 , 132. 

Faithful realization 114 . 

Ferro-magnetism 347. 

Field equations, for electro-magnetic 
field 102, 218, for matter 213 ff., 
their quantization 104 ff., 253 ff. 

Field, number f. 294, 302 , algebraically 
closed 294 , commutative 302 , finite 
f. of modulus p 303 ; — ray f. 20 , 
vector f. 20, point-f. 1 10. 

Fine structure, in hydrogen 203, 236 

f. s. constant 216 . 

Form, linear 12, bi-lmear 13, 16, 18, 
Hermitian 18 , unitary 16 , commu- 
tator 273, anti-symmetric bi-linear 

273, 397. 

Fourier coefficient 33, series 33, in- 
tegral 39, F. c. or group matrix for 
representation 165. 

Franck 46, 70, 74. 

Frequency 50, Bohr’s f. rule 47, 105, 
109. 

Frobenius 156, 358, 383. 

Function space 32, of quadratically 
integrable functions 143. 


Galois, 132. 

JT-process 126. 

Gamow 74. 

Gauge invariance 100 , 213, 220, rela- 
tion to conservation of electricity 214, 
217, role in quantization 256, 271. 

Generating function of infinitesimal 
canonical transformation 97, 

Generating unit 291 , independent 292 , 
in field of complex numbers 295, of 
symmetry class of tensors 296. 

Geometry, affine or vector i ffi, 112, 
Euclidean 15. 112^ unitary 15 ffi, 

characterized by group 112. 

Gerlach 65, 75. 

Germer 50, 53, 70. 

^-factor, Lande, 204, 205, 207. 

Goudsmit 203. 

Group 1 10 ff , transformations g. Ill, 
abstract 114 ffi, isomorphic 115 , 
automorphic correspondence of g. 
1 15, 134 , commutative or Abelian 
118 , cyclic 1 17, order of finite g. 
118 , of element of g. 117, central 


118 , sub-g. 116 , index of sub-g. 

118 , self-conjugate or invariant sub-g. 

119 , 132. factor g. 119 , simple 132 , 
direct product 127 , closed continuous 
160 ffi, Lie theory of continuous g. 
175 ffi, g. manifold 160 ffi, invariant 
sub-space of g. manifold 29 1 ; — realiz- 
ation of g. 114 , representation of g 

120, of sub-g. 127, 334, of direct 
product 333, g. matrix 165 , algebra 
of g. 166 , 18 1, 286. For special 
groups, see wWer qualifying adjectives. 

Gurney 74. 

Gyro-magnetic effect 205. 


Hallwachs 42. 

Hamilton 50, 138. 

Hamiltonian equations, in classical 
mechanics 96, 98, in quantum mech- 
anics 94, in quantum field theory 253. 

Heisenberg xiii, 48, 80, 82, 222, 264, 

347. 

Heisenberg’s co-ordinate system 80 . 

Heisenberg- Pauli theory of the quantum 
field 253 ff. 

Heitler 342. 

Hellinger 39, 40. 

Hermite 18. 

Hermitian form or operator 18 , non- 
degenerate 18, positive definite 18, 
unit 15, idempotent 23, in function 
space 35, 37, bounded 39, product 
of H. f. 20, • trace 20, characteristic 
number 21, 35, transformation to 
principal axes for single H. f. 21 ff., 
32, for Abelian system 25 ; — H. f. 
represents physical quantity 74, 275, 
chararteristizes statistical aggregate 
79 » 239; — H. conjugate 17 . 

Hermitian polynomials 57 ff. 

Hertz, G. 46, 70, 74. 

Hertz, H. 42. 

Hilbert 39. 

Hilbert space 32. 

Hund’s vector model of the atom 19 1, 
244. 

Hydrogen atom 45, on Schrodinger’s 
theory 63 ff., on Dirac’s theory 234 ff., 
spectrum 45, 69, fine structure 203, 
236. 
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Idempotent Hermitian form 23, 37, 

independent 25 ; — i. element of an 
algebra 168, 591 , independent 292 , 
primitive 293 . 

Identity correspondence 6, 110 , repre- 
sentation 12 1. 

Independent, linearly i. vectors 2, i. 
idempotent forms 23, idempotent 
elements of algebra 292 . 

Index of sub-group 118 . 

Infinitesimal unitary transformation 28 ff., 
rotation 27 ff., moment of momentum 
induced by i. r. 178, canonical trans- 
formation 96, element of continuous 
group 160, 177. 

Inner quantum number, see under 
Quantum number. 

Intensity, as measure of probability 49, 
i. of dynamical variable on transition 
83 , 197, of spectral lines 44, 83, 232, 
in anomalous Zeeman effect 201. 

Interaction between matter and radia- 
tion 104 ff., 261. 

Interchange, of right and left 225, of 
past and future 109, 227, 263. I 

Invariance, in special relativity, dif- 
ficulty for quantum mechanics 54, 
Dirac’s treatment 210 ff., i. of 
quantum field equations 268 ff. ; — in 
sense of general relativity 219, under 
change of gauge 100 , see Gauge 
invariance. 

Invariant of transformation group 117, 
170 , in representation space 17 1, 
classical theory 170 ff. 

Invariant sub-space 8, under .system 
of transformations 122, 135, 282, 

left-i. s.-s. in group space 289 ff., left- 
and right-i. s.-s. 168, 31 1, in tensor 
space 296 ff., significance in quantum 
theo^ 320 ; — i. sub-group 119 , 
maximal 132. 

Inverse correspondence 6, 111 , element 
of group 1 14. 

Involution 13. 

Ionization potential 46. 

Irreducible invariant sub-space 122 , 282, 
system of linear transformations, re- 
presentation 122, reduction into i. 
constituents 122, 135 ; — irreducibility 
= complete irreducibility in unitary 
domain 136, 292, 301, for reducible 
algebra 305. for algebra of trans- 
formations in completely reducible 
vector space, 307, 
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Isomorphic correspondences 1 1 2 
simply isomorphic groups 115 . 

Jeans 42, 102, 103 

(jj) coupling 206. 

Jordan-Holder theorem 131 ff. 

Jordan, P. 261, 280. 

Kinematically independent systems 92, 
190, perturbation of 93. 

Kinematics of system determines repre- 
sentation in system space 189, 
Heisenberg’s quantum k. 94 ff., as 
Abelian group of rotations 272 ff., 
in second quantization 250, k. of 
spin 195, 203, 276. 

Klein’s Erlanger programme xv, 112. 

Laguerre polynomials 70. 

Lande, 204, 208. 

Laporte’s rule 201, 203. 

Legendre polynomials and associated 
functions 62, with spin 230. 

Lenard 42. 

Leonardo da Vinci 1 12. 

Lie 176. 

Light, wave and corpuscular nature of 
48 ff-, 53 - 

Linear, 1 . algebra 303 , see Algebra ; — 
1 . correspondence 5 , see under Corre- 
spondence; — 1. form 12, 1. covariant 
quantity 173 , 1 . projection — 1 . cor- 

respondence 282, 1. sub-space 2; — 
1 . momentum, sec Momentum, linear. 

Linear group, complete Cn 123, .simplest 
representations 123 ff., representa- 
tion {S^ of Ca 128 ff., its ir- 

I reducibility 299, representation iSf,g 
13 1, 164 ; — reduction of (c)-^ equivalent 
to reduction of algebra of symmetric 
transformations 284 ff., unitary re- 
striction immaterial 285, result of 
the reduction 301, characteristics 
335 ff., relation to characters of 
symmetric permutation group 326- 
representations of order / 309, 

branching law 391. 

London 342, 346, 370. 

Lorcntz group, restricted, obtained from 
Ca 147 ff., complete L. g. obtained on 
adding reflection 147, positive and 
negative transformations 147, and 
Dirac’s equations 212 ff., transforma- 
tion induced in system space 268 ff. 

Lyman 45. 
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INDEX 


Magnetic quantum number, see under 
Quantum number. 

Magneto-mechanical anomaly 205. 

Magneton, Bohr 66, 205. 

Magnitude, absolute, of vector 16, 19. 

Mapping no, see Correspondence, 
T ransformation. 

Matric algebra, simple 168, 313. 

Matrix 7 , dual or transposed 13 , unit 
6, addition 7, multiplication 8, re- 
duced and completely reduced 9, 
transformation of m. 8, norm ii, 
trace 11 ; — group m. 165 . 

Maxwell’s equations 102, 218, quan- 
tization of 104 ff., 253, M. action 215^ 

Mean value or expectation of physical 
quantity in pure state 75, 78, 92, in 
mixed case 79; — m. v. over group 
manifold 158. 

Measurement of dynamical variable 74 ff 

Metric 15. 

Millikan 42, 245. 

Minkowski, H. 79. 

Mixed state 79. 

Modulus, of algebra 168, 304, reduc- 
tion of 168, 301 ; — of finite field 303 , 

Molecule, spectrum 19 1, perturbation 
theory and constitution 339 ff., non- 
polar bond 342, London formula 
for binding energy 346, on taking 
account of Coulomb forces 356, val- 
ence theory 369 ff. 

Moment of momentum of a representa- 
tion 179 , of 179 ; — m. of m. of phy- 
sical system 187 , orbital 64, 195 , 
spin 195 , 203, 218, behaviour on 
composition 190, conservation 188, 
219 ff., 227, reduction of system 
space with respect to m. of m. 192, 
induced by infinitesimal rotations of 
Lorentz transformations 185, 269. 

Momentum, linear, and its operator 51, 
220, conservation of energy and m. 
218, 264 ff. 

Moseley’s law 69. 

Motions, geometrical in, group of 176. 

Multiplet 196, 206 , 373, as relativis- 
tic phenomenon 204, 234, normal 
Zeeman effect loi, 193, 198, anom- 
alous Zeeman effect 204, 208 ff., 
alkali doublets 204, singlets and 
triplets in alkaline earths 207, 246, 


multiplicity 321, 350, under Pauli 
exclusion principle 352, in 2 -dimen- 
sional spin 355, 369, multiplicity and 
valence 369 n., branching rule and 
alternation law 207, 370. 

Multiplication, of vector by number 1, 
of correspondences and matrices 6 ff., 
of numbers of field 302, of elements 
of algebra 165, 303, quaternion m. 
138, outer or X -m. of spaces, vectors, 
operators 90 , 125, of representations 
126 , direct product of groups 127 , 
333 » of algebras 333, X -m. of repre- 
sentations 127 ; — scalar m. of vectors 
16 , of elements of an algebra 299 , 317. 

V. Neumann 40, 78. 

Noether, E. 134. 

Normal co-ordinate system 16 , in rel- 
ativity 147, n. state of atom 45, 
n. term order 206. 

Number, of field 302 , operations on 302 ; 

— characteristics n. 21. 

Operator = linear correspondence 6, 
Hermitian 18 , in function space 35, 
representing dynamical variable 55, 
considered as ^ function of time 81, 
derivative of o. 94. 

Orbit, in older quantum theory 47, 
orbital moment of momentum 64, 

195 . 

Order, of finite group 118 , of element 
of group 1 17, of sub-group 118, 
of finite algebra 303 . 

Orthogonal group, see Rotation group ; 

— o. transformation 16, o. vectors 16. 

Orthogonality relations 32, for group 
characters 159 flf,, 317, for sym- 
metric permutation group 367. 

Oscillator 43, 56 ff., 84, black body 
radiation as system of o. 102 ff., 258, 
quantum mechanical laws of system 
of o. 249. 

Parseval’s equation 33, 35, 162. 

Paschen 45, 236. 

Paschen-Back effect 208. 

Pattern, .symmetry, see Symmetry 
pattern. 

Pauli 77, 203, 21 1, 244, 264, 347, 351. 

Pauli exclusion principle 207, 244 ff., 
and reduction of algebra of sym- 
metric transformations 281, 323, 347 ff., 
3 SS, 370 ff. 
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Peirce reduction 312. 

Periodic system of the elements 69, 
242 ff. 

Permutation 1 1 , reduction into cycles 
328, conjugate 328, as operator on 
tensor 281. 

Permutation group, symmetric 121 , 
classes 328, elements as symmetry 
operators 286 , relatior^ to symmetry 
class of tensors 286 ff., for arbiirary 
P- g- 332, characters 320, 383 ff., 
relation to characteristics of unitary 
group 331, use of characters to 
calculate exchange energies 322 ff., 
energy of non-polar bond 346, ex- 
plicit theory of representations 358 ff., 
reciprocity theorems 339, branching 
law 390. 

Perturbation theory 86 ff., for kine- 
matically independent systems 93, 
for equivalent individuals 321 ff., 
for molecules 339 ff. ; — p. energy 86, 
for axially symmetric field 192, for 
magnetic field loi, 193, 204, 224, 
for electric field loi, 224, spin p. 196, 
in Dirac theory 224, determines 
transition probability 89. 

Pfund 46. 

Photo-electric effect 42. 

Photon 42, 49, 54, 104, 248, 258, 261. 

Planck xiii, 41. 

Planck’s radiation law 41, 108. 

Point-field no. 

Polynomial, characteristic 11,22; — Her- 
mitian 57 ff., Legendre 62, with 
spin 230, Laguerre yo. 

Primitive unit 293 , character 150, 
symmetry class 358. 

Principal unit of algebra 168, 304; 

— p. transformation 128, transforma- 
tion of Hermitian forms to p. axes 21, 
25, 32, 39, for unitary forms 26, 39; 

— p. quantum number, see under 
Quantum number. 

Probability, relation to intensity 49, 
that a dynamical variable assume a 
given value in a pure state 75, in a 
mixed state 79, p. density and current 
density 50, 2 1 5 , 2 1 7 ; — transition p. 73, 
83, 89, in composite system 90, 93, 
for an atom in radiation field 106 ff. 

Product, see Multiplication. 

Projection, with respect to sub-space 4 , 
in unitary geometry 18, orthogonal 
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and unitary- orthogonal 23, linear 
p. = linear correspondence 282. 

Proton, Dirac’s theory of 262. 

Pure state 75 , conditions for 77. 

Quantization, in the older quantum 
theory 47, in Schrodinger’s theory 
51, 56, in Heisenberg’s 93 ff., of 
composite system 89, of electro- 
magnetic field 104, 253, second 246, 
of Maxwell- Dirac field equations 
253 ff. ; — directional or space q. 67, 75 , 
205. 

Quantum, of action 41,51, of energy 4 1 . 

Quantum kinematics, Heisenberg’s 94 ff., 
as Abelian group of rotations 272 ff., 
in second quantization 250. 

Quantum mechanics, general scheme 
74 ff., dynamical law 54, 80, 97, 187, 
266, composition 91, Heisenberg’s 
formulation 93, Schrodinger’s equa- 
tion 54, 101, Dirac’s equations 213, 
218, Heisenberg- Pauli q. m. of wave 
fields 253 ff. 

Quantum number, auxiliary (k) 228 , 
selection rules 233, relation to azi- 

I muthal and inner q. n. 228, 233 ; — 

I azimuthal q. n. (/, L) 64 ff., 142, 196, 

determines orbital moment of mo- 
mentum 65, 196, selection rules 84, 
201, on composition 194, 207, 373, 
relation to auxiliary q. n. 228, 233 ; 

— inner q. n. {;,/) lo 9 , 196, deter- 

mines total moment of momentum 
179, 189, behaviour on composi- 

tion 190, 194, 206, selection rules 
198, relation to auxiliary q. n. 228, 233 ; 

— magnetic (m) 64, 193 , determines 
0-component of moment of momentum 
65, 180, 189, selection rules 85, 198, 
of spin and of orbital moment of 
momentum 209, in Dirac’s theory 
232 ; — principal or total (n) in hydro- 
gen 69 , in hydrogen-like spectra 85, 
has no group-theoretic significance 
144, true 86, 243, effective 243; — 
radial 64, 144; — spin (^) 206, re- 

[ lation to valence 369. 

Quantum state 43, 56, 80 , 188, simple 

189 . 

Quaternion 138, complex 182. 

Radial quantum number, see under 
Quantum number. 

Radiation, from atom 44, 83 ff., 105 ff., 
224, field 102 ff., 215, 256 ff., black 
body 41, 104. 



420 


INDEX 


Ray 4 , 20, represents state of physical 
system 75, r. field 20, rotations of 
r. field 273, r. representation 18 1 fif. 

Rayleigh 42. 

Real element of algebra 167, gener- 
ating unit 295. 

Realization of group 114 , faithful 114 , 
contracted 118, 119, of algebra 166 ; 
— linearr. = representation 120, see 
Representation. 

Reciprocity theorem, for arbitrary group 
338, for permutation group 339. 

Reduction of correspondences or re- 
presentation 9, 122, uniqueness 136, 
156, complete r. 9 , 122 , 135 {see 
Complete reduction), sometimes im- 
plies complete r. 18, 123, 136, 292, 301, 
306, 308, of regular representation 
289 ff., 305 ff., of system space of 
equivalent individuals 238 ff., anti- 
symmetric r. for electrons 242, 351 ff., 
symmetric r. for photons 248, 351 ff., 
influence on term spectrum 241 , 372 ff., 
general treatment without spin 296 ff., 
with spin 347 ff., for symmetric and 
anti-symmetric cases 351 ff 

Reflection, signature induced by r. 143, 
146, 188. 

Regular representation 289 , reduction 

305 ff- 

Relativity theory, special 5 1, 98 ff., 146 ff., 
of quantum mechanics 210 ff., of 
wave fields 268 ff., r. and spin 204, 217, 
222 ff., ; — general 219. 

Representation, of finite group 120 , 
of continuous group 160 ff., by ro- 
tations of ray space 18 i, degree or 
dimensionality 120, character 150 , 
complete reduction 122, irreducible 
122, uniqueness of reduction 136, 
156, criterion for irreducibility 159, 
identical 12 1, equivalent 121, unit- 
ary 136 ff., any r. equivalent to unitary 
r- 157; — formal proce.sses : addition 
126 , X -multiplication 126 , 127, X- 
multiplication 127, JT-process 126, 
r of sub-group 127 ; — of algebra 166 , 
304 ff., regular 289 ; — general theory : 
orthogonality properties 157 ff., 317, 
in terms of group algebra 165 ff., 
completeness of system of r. 159, 170, 
318, proved by reduction of regular 
r. 305 ff. For r. of special groups, 
see under qualifying adjective. 

Resonance, between states of same energy 
87, between equivalent individuals 
^ 239 ff., 320. 


Resonance line 45. 

Ritz-Rvdberg combination principle 44, 
48, 82. 

Rontgen 43. 

Rotation group, in 2-space and its re- 
presentations 140 ff., orthogonality 
of characters 162 ; — in 3-space 

and its representations 142 ff., rela- 
tion to unitary group in 2-space 144, 
augmentation by improper rotations 
143, orthogonality of characteristics 
163, completeness 143, 163, 180, 184, 
380, generated by infinitesimal ele- 
ments 175, representation induced in 
system space 185, 195,372; — m 

« -space 184. 

Rotation in ray space 21, 18 1, 273, 
representation by r. of ray field 180, 
quantum kinematics as Abelian group 
of r. 272 ff. 

Rupp 50. 

Russell-Saunders coupling 206. 

Rutherford xiii^ 74. 

Rydberg number xiii, 45, 69. 


Scalar product, see Multiplication. 

Scalar quantity, commutes with moment 
of momentum and signature 188, 
selection rules 197. 

Schrodinger 48, 50, 56, 102, 187, 216, 
220, 258. 

Schrodinger’s equation 54 ff., relativ- 
istic JO I, for system of equivalent 
particles 194, as limiting case of 
Dirac's 234, derived from com- 
mutation rules 277 ff. 

Schur, I. 152. 

Schwarz' inequality 30, 393. 

Second quantization 246, see under 
Quantization. 

Secular equation 11,21, 26, in quantum 
theory te, 209, 344. 

Selection rules 44, 84, 85, for oscillator 
84, for electron without spin 84 ff., 
with spin 232, for scalar quantity 197, 
for vector quantity 197, for auxiliary 
quantum number 233, azimuthal 84, 
201, inner 198, magnetic 85, 198, 
for signature 201. 

Self-conjugate sub-group 119 , maxima 

132. 

1 Semi-simple algebra 316. 
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Separation of terms by perturbation 87, 
321, axially symmetric perturbation 
193, in normal Zeeman effect loi, 193, 
198, in anomalous Zeeman effect 204, 
208 ff 

Series, in hydrogen 45, 69, in alkalies 
85, 202. 

Series of composition, see Composition 
series. 

Signature, of representation 143 , as 
dynamical variable 188 , 203, selec- 
tion rule 201 . 

Simple algebra 31 1, 313, group 132 , 
state 189 . 

(si) coupling 206. 

Smekal-Raman effect 224. 

Sommerfeld 193, 236. 

Space, affine, linear, vector i ff., linear 
sub'S. 2 , dual 12 , unitary 15 ff., 
Hilbert or function 32, 143, reduction 
or decomposition 20, 22, composition 
series 122, 135, product 90 , tensor 
125, 281 ff., group s. 1 15, 160, re- 
presentation 120, 17 1 ff., algebra as 
vector s. 286, 305, system, see System 
space 

Space quantization 67, 75, 205. 

Span, space sjmnned by vectors 3, 20. 

Spectrum, atomic, line s. reduced to 
term s. 44, of hydrogen and i -electron 
ions 45, in Schrodinger’s theory 69, 
in Dirac’s theor> 234, of alkalies 85 ff., 
doublets 204, of alkaline earths 207, 
246, 3-elcctron 374, of elements of 
periodic table 206 ff., 242 ; — - general 
theory, without spin 194, with spin 
206 ff., application of Pauli ex- 
clusion principle 242 ff., group- 
theoretic classification 369 ff., re- 
duction into term classes 283 ff., 320 ff., 
calculation of term values 320 ff. ; 
— molecular 1.9 1 ; — of characteristic 
numbers 36. 

Spherical harmonics 60 ff., 84, as basis 
of unitary representation in function 
space 142, with spin 230 ff. 

Spin, electron 195 , 196 , 203, as relativ- 
istic phenomenon 204, 217, 222 ff., 
s. moment of momentum 195 , 221, 
magnetic effect 204, 224, s. and 
valence 369 ff. ; — s. perturbation 196, 
203, in Dirac’s theory 222 ff. ; — s. 
quantum number, see under Quantum 
number. 

Stark effect, linear 102. 


State of a physical system, represented 
by vector or ray in system space 54, 
74 ff., pure 75, 78, mixed 79, of 
total system under-determined 92 ; — • 
quantum or stationary 43, 56, 80 , 188, 
simple 189 . 

Stationary state, see under State. 

Statistical aggregate 78, 239, canonical 

79. 

Statistics, Bose-Einstein 50. 

Stern-Gerlach effect 65, 75, 205. 

Stieltjes integral 37 

Stoner’s rule 243. 

Sub-algebra, left-invariant 289, (left- 
and right-) invariant 167, 31 1, 314. 

Sub-group 116 , 334 ff., cyclic 1 17, 
index 118 , self-conjugate or invariant 
119 , maximal invariant 132. 

Sub-space 2 , 32, invariant, under single 

j transformation 8, under system of 
transformations 122, equivalent or 
similar 135 , 283, see also Invariant 
sub-space. 

Substitution ill, see Correspondence. 

Sum, see Addition ; — s. rule for influence 
of magnetic field, 209. 

Superposition principle 49. 

Symmetric permutation group, see Per- 
mutation group, symmetric. 

Symmetric transformation in tensor space 
282 , special 284, Hermitian 2S3, 
unitary 285, enveloping algebra 284, 
for arbitrary permutation group 332. 

Syrnmetrization 358. 

Symmetry class of tensors 287 , 296, 
primitive 358, of spectral terms 321, 
multiplicity 321, 350 ff., 367. 

Symmetry operator 286 , Young’s 359 . 

Symmetry pattern 358 ff., dual on trans- 
posed 361, 368, generated by Young 
symmetry operator 359 ff. 

System space for translation 54, 74, 195, 
for spin 195, total 185, 196, 347 ff., 
for equivalent individuals 186, 206 ff., 
347 ; — reduction with respect to 
energy 80, moment of momentum 
188, 206, with regard to symmetric 
permutation group 283 ff., 320 ff., 
with regard to Pauli exclusion prin- 
ciple 242 ff., 281 ff., 347 ff. 
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Tensor 125 ff., 139, 281, symmetry 
class of t. 287 , 338, 358, double to 
347 ; — t. Space 125, 281 ff., symmetric 
transforfnation in t. space 282, in- 
variant sub-space 296, reduction 301 ; 
— energy- momentum t. 218. 

Term 44, as energy level or character- 
istic number 46, 5b, 80, see also under 
Spectrum, Separation ; — t. order, 
normal 206. 

Thomson, G. P. 50. 

^ Total quantum number, see under 
Quantum number. 

Trace, of matrix or correspondence 11 , 
150, of element of algebra 299 , 317 . 

Transformation, linear 4 = Correspond- 
ence, linear ; — contragredient 12, unit- 
ary 16 , principal 128, symmetric in 
tensor space 282 , for arbitrary per- 
mutation group 332 , special sym- 
metric 284, canonical 96 , in 
quantum mechanics 98 ; — t. to principal 
axes 21 ff., 37 ; — t. group 111, for 
special groups, see under qualifying 
adjective. 

Transition probability 83, 89, in radia- 
tion field 106 ff. 

Translation, left- 1 16, right- 1 16. 

Translation, electron 195 . 

True quantum number, see under 
Quantum number. 

Uhlenbeck 203. 

Uncertainty principle 77, derivation 
393 - 

Unimodular linear transformation, group 
128. 

Unit, element of group 1 14, of field 302, 
of algebra (modulus or principal unit) 
304, basal 168, 304, idempotent 

generating 168, iSjl, independent 
292 , primitive 2 ® 3 , real 295 ; — u. 
Hermitian form 15. 

Unitary correspondence, transformation, 
matrix 16 ff., characteristic numbers 
26, infinitesimal 28 , u. geometry 
15 ff., u. t. as canonical t. of quantum 
mechanics 98, u. representation of 
group 137 ff. 


Unitary group, in 2-space 137 ff., its 
unitary representations 6/ 137, com- 
pleteness 137, 163, 389, character- 
istics 151, 163, connection with ro- 
tation group bs 144, augmented 146 ; 
— in «-space 1 39 ff., reduction of (u)^ 
and algebra of symmetric transforma- 
tions 285, characteristics 331, 381, 
completeness 381. 

Unitary- orthogonal system of vectors 
or functions* 19, 33, completeness 33, 
on group manifold 158. 

Valence 342, 369 , v. electron 86, 243 

Vector, V. space, v. geometry 1 ff., in 
Hilbert space 31 ff., v. field 20 , co- 
variant and contravariant 13, absolute 
magnitude 16 , dual 17, scalar pro- 
duct 16 , unitary-orthogonal v. or 
system 16, 19, as element of Abelian 
group 134 ; — 3-v. operator in quantum 
mechanics 197, selection and intensity 
rules 198 ff., complete system of 
orthogonal v. in 3-space 257, v. 
potential of electro-magnetic field 98. 

Vector model of atom, Hund’s 191. 

Velocity, phase and group 53. 

Volume, measure of, on manifold of 
closed continuous group 160, for 
unitary group 386, for unitary uni- 
modular group 162, 389. 

Wave equation, de Broglie’s 53, 
Schrodinger’s 54 ff., loi, Dirac’s 213, 
2 i8, 225. 

Wave field, Heisenberg- Pauli quantiza- 
tion of 253 ff. 

Wave length 53. 

Wedderburn’s theorem 313. 

j Wentzel 74. 

Wien 41. 

Wigner 280, 320. 

Wintner 39. 

Young, A. 358. 

Young’s symmetry operator 359 . 

Zeeman effect, normal 85, loi, 193, 198, 
anomalous 198, 204, 208, 223, for 
doublets 204, for multiplets in gene- 
ral 208 ff. 








